Restrictive inverted terminal repeats for viral vectors

ABSTRACT

This invention relates to modified parvovirus inverted terminal repeats (ITRs) that do not functionally interact with wild-type large Rep proteins, synthetic Rep proteins that functionally interact with the modified ITRs, and methods of using the same for delivery of nucleic acids to a cell or a subject. The modifications provide a novel Rep-ITR interaction that limits vector mobilization, increasing the safety of viral vectors.

RELATED APPLICATIONS

This application is a divisional of U.S. patent application Ser. No. 13/521,448, filed Jul. 11, 2012, now U.S. Pat. No. 9,169,494 which claims priority to and is a 35 U.S.C. § 371 national phase application of PCT Application PCT/US2011/020939, filed Jan. 12, 2011 which claims priority to U.S. Provisional Application No. 61/294,181, filed Jan. 12, 2010. The entire content of each of these applications is incorporated herein by reference.

STATEMENT OF FEDERAL SUPPORT

This invention was made with government support under GM059299, HL066973, HL051818, AI072176 awarded by the National Institutes of Health and AI007419 awarded by the National Institute of Allergy and Infection Diseases. The government has certain rights in the invention.

STATEMENT REGARDING ELECTRONIC FILING OF A SEQUENCE LISTING

A Sequence Listing in ASCII text format, submitted under 37 C.F.R. § 1.821, entitled 5470-547_ST25.txt, 454,116 bytes in size, generated on Nov. 7, 2012 and filed via EFS-Web, is provided in lieu of a paper copy. This Sequence Listing is hereby incorporated by reference into the specification for its disclosures.

FIELD OF THE INVENTION

This invention relates to modified parvovirus inverted terminal repeats (ITRs) that do not functionally interact with wild-type large Rep proteins, synthetic Rep proteins that functionally interact with the modified ITRs, and methods of using the same for delivery of nucleic acids to a cell or a subject. The modifications provide a novel Rep-ITR interaction that may limit vector mobilization, increasing the safety of viral vectors.

BACKGROUND OF THE INVENTION

The adeno-associated viruses (AAV) are members of the family Parvoviridae and the genera Dependoviruses. Serotypes 1 through 4 were originally identified as contaminates of adenovirus preparations (Carter and Laughlin (1984) in, The Parvoviruses p. 67-152 New York, N.Y.) whereas type 5 was isolated from a patient wart that was HPV positive. To date, twelve molecular clones have been generated representing the serotypes of human/primate AAV (Bantel-Schaal et al. (1999) J. Virol. 73: 939; Chiorini at al. (1997) J. Virol. 71:6823; Chiorini et al. (1999) J. Virol. 73:1309; Gao et al. (2002) Proc. Nat. Acad. Sci. USA 99:11854; Mori et al. (2004) Virol. 330:375; Muramatsu et al. (1996) Virol. 221:208; Ruffing et al. (1994) J. Gen. Virol. 75:3385; Rutledge et al. (1998) J. Virol. 72:309; Schmidt et al. (2008) J. Virol. 82:8911; Srivastava et al. (1983) J. Virol. 45:555; Xiao et al. (1999) J. Virol. 73:3994). These clones have provided valuable reagents for studying the molecular biology of serotype specific infection. Transduction of these viruses naturally results in latent infections, with completion of the life cycle generally requiring helper functions not associated with AAV viral gene products. As a result, all of these serotypes are classified as non-pathogenic and are believed to share a safety profile similar to the more extensively studied AAV type 2 (Carter and Laughlin (1984) in, The Parvoviruses p. 67-152 New York, N.Y.).

General understanding of the mechanisms required for function at origins of replication has grown immensely since the first prokaryotic origins were characterized. While the DNA-protein interactions necessary for replication in prokaryotes, lower eukaryotes, and bacteriophages are generally well understood, mechanisms employed in the majority of higher eukaryotes and vertebrate viruses, such as AAV, are still being determined. The inverted terminal repeats (ITRs) of AAV and other Parvoviruses act as the origin of replication. These elements flank the short, single stranded genome and typically possess a T-shaped secondary structure. The replication strategies of the genus Dependovirus, including those of AAV, have been well characterized. The viral non-structural or Replication proteins (Rep) are the only factors required to interact with the ITR in order to catalyze replication (Im and Muzyczka (1990) Cell 61:447). The majority of AAV serotypes possess highly conserved origins of replication with interchangeable DNA-protein interactions. However, the Rep proteins of several serotypes interact exclusively with their cognate ITR. Discovering the mechanisms which drive Rep-ITR specificity promises to advance our understanding of DNA-protein interactions at viral origins of replication. These findings also promise to shed light on how eukaryotic and prokaryotic proteins achieve selectivity to DNA substrates.

The AAV rep gene encodes four multifunctional proteins (Hermonat et al. (1984) J. Virol. 51:329; Tratschin et al. (1984) J. Virol. 51:611; Mendelson et al. (1986) J. Virol. 60:823; Trempe et al. (1987) Virol. 161:18) that are expressed from two promoters at map units 5 (p5) and 19 (p19). The larger Rep proteins transcribed from the p5 promoter (Rep78 and Rep68), are essentially identical except for unique carboxy termini generated from unspliced (Rep78) and spliced (Rep68) transcripts, respectively (Srivastava et al, (1983) J. Virol. 45:555). The two smaller Rep proteins, Rep52 and Rep40, are transcribed from the p19 promoter and represent amino terminal truncations of Rep78 and Rep68, respectively.

Several biochemical activities of Rep78 and Rep68 have been characterized as involved in AAV replication. These include specific binding to the AAV ITR (Ashktorab et al. (1989) J. Virol. 63:3034; Im et al. (1989) J. Virol. 63:3095; Snyder et al. (1993) J. Virol. 67:6096) and site-specific endonuclease cleavage at the terminal resolution site (trs) (Im et al. (1990) J. Virol. 63:447; Im et al. (1992) J. Virol. 66:1119; Snyder et al., (1990) Cell 60:105; Snyder et al. (1990) J. Virol. 64:6204). Rep78/68 also possess ATP dependent DNA-DNA helicase (Im et al., (1990) J. Virol. 63:447; Im et al. (1992) J. Virol. 66:1119) and DNA-RNA helicase as well as ATPase activities (Wonderling et al. (1995) J. Virol. 69:3542). In addition to these activities involved in replication, Rep78/68 also regulate transcription from the viral promoters (Beaton et al. (1989) J. Virol. 63:4450; Labow et al. (1986) J. Virol. 60:251; Tratschin et al. (1986) Mol. Cell. Biol. 6:2884; Kyostio et al. (1994) J. Virol. 68:2947; Pereira et al. (1997) J. Virol. 71:1079), and have been shown to mediate viral targeted integration (Xiao, W., (1996), “Characterization of cis and trans elements essential for the targeted integration of recombinant adeno-associated virus plasmid vectors”, Ph.D. Dissertation, University of North Carolina-Chapel Hill; Balague et al. (1997) J. Virol. 71:3299; LaMartina et al. (1998) J. Virol. 72:7653; Pieroni et al. (1998) Virol. 249:249).

Like Rep proteins, the AAV ITRs are involved in nearly every aspect of the viral life-cycle. The secondary structure of the ITR is necessary to prime synthesis of the second strand to allow transcription of the viral genes (Hauswirth and Berns (1977) J. Virol. 78:488). The full length Rep proteins contain a unique N-terminal DNA binding region which specifically recognizes the ITR at the 16 nt Rep-binding element (RBE) and at the tip of one of the hairpin stems known as the RBE′ (FIG. 1A) (Ryan et al. (1996) J. Virol. 70:1542; Brister and Muzyczka (2000) J. Virol. 74:7762). Rep molecules multimerize on the ITR allowing the C-terminus of Rep, acting as an ATP-dependent SF3 helicase, to unwind the ITR and form a putative internal hairpin (Im and Muzyczka (1990) Cell 61:447; Hermonat and Batchu (1997) FEBS Lett. 20:180). This hairpin, (here, termed ‘nicking stem’) contains the terminal resolution site (trs) where Rep nicks the ITR in a site-specific manner (Brister and Muzyczka (1999) J. Virol. 73:9325). This DNA cleavage is important for replication of the closed ITR and to initiate subsequent rounds of genomic replication. Replicated genomes can undergo replication again or be encapsidated in the presence of the smaller Rep proteins (King et al. (2001) EMBO J. 20:3282).

The ITR sequences of twelve human/primate AAV serotypes have been published. These sequences typically display 80% or greater nucleotide conservation and segregate into two groups (Hewitt et al. (2009) J. Virol. 83:3919). The AAV2 Rep proteins (Rep2) are able to function on the ITR of every known AAV serotype except those of AAV5 (ITR5; Hewitt et al. (2009) J. Virol. 83:3919; Grimm et al. (2006) J. Virol. 80:426). Consistently, the AAV5 Rep proteins (Rep5) are unable to catalyze replication of the ITR of AAV2 (ITR2). Replicative specificity between these serotypes does not exist at the level of binding, as Rep2 and Rep5 can bind interchangeably to ITR2 or ITR5 (Chiorini et al. (1999) J. Virol. 73:4293). Instead, specificity is created by the inability of Rep to cleave the ITR of the opposite serotype. This occurs despite high conservation between the ITR2 and ITR5 sequence, secondary structure, and location of elements required for Rep interaction (RBE, RBE′, trs, nicking stem).

All current AAV vectors in clinical trials utilize ITR2s. However, using ITR2s for therapeutic purposes creates a safety risk due to the ubiquity of AAV2 in the human population as well as other AAVs whose Rep proteins can replicate ITR2s. In this manner, rAAV vectors have the potential to be “mobilized” out of the target tissue into different tissues of the body or into other individuals in the population (Hewitt et al. (2009) J. Virol. 83:3919).

The present invention provides a solution to vector mobilization through the creation of a novel Rep-ITR interaction. A vector utilizing this novel interaction cannot be mobilized by one or more of the wild-type AAV serotypes which infect humans, nor the non-human serotypes which can potentially infect human hosts.

SUMMARY OF THE INVENTION

The present invention relates to the discovery of unique mechanisms at the DNA and protein level to achieve Rep-ITR specificity and utilizes these factors to create novel AAV origins of replication. Thus, one aspect of the invention relates to a polynucleotide comprising at least one parvovirus inverted terminal repeat (ITR), wherein said ITR comprises: (a) a first structural element that functionally interacts with a large Rep protein from a first AAV but does not functionally interact with a large Rep protein from a second AAV; and (b) a second structural element that that functionally interacts with the large Rep protein from the second AAV but does not functionally interact with the large Rep protein from the first AAV; wherein the ITR functionally interacts with a synthetic AAV large Rep protein. The invention further relates to a viral vector and a recombinant parvovirus particle comprising the polynucleotide of the invention. Further provided are pharmaceutical formulations comprising a virus particle of the invention in a pharmaceutically acceptable carrier.

Another aspect of the invention relates to a synthetic large Rep protein comprising a first portion that functionally interacts with a first structural element of a parvovirus ITR and a second portion that functionally interacts with a second structural element of a parvovirus ITR, wherein said first structural element functionally interacts with a large Rep protein from a first AAV and said second structural element functionally interacts with a large Rep protein from a second AAV that is different from the first AAV. The invention further relates to polynucleotides encoding the synthetic large Rep protein and vectors and cells comprising the polynucleotide.

An additional aspect of the invention relates to a method of producing a recombinant parvovirus particle, comprising providing to a cell permissive for parvovirus replication: (a) a recombinant parvovirus template comprising (i) a heterologous nucleotide sequence, and (ii) the parvovirus terminal repeat sequence of the invention; (b) a polynucleotide encoding a Rep protein of the invention; under conditions sufficient for the replication and packaging of the recombinant parvovirus template; whereby recombinant parvovirus particles comprising the parvovirus capsid encoded by the cap coding sequences and packaging the recombinant parvovirus template are produced in the cell.

A further aspect of the invention relates to a method of delivering a nucleic acid to a cell, comprising introducing into a cell the recombinant parvovirus particle of the invention.

Another aspect of the invention relates to a method of administering a nucleic acid to a mammalian subject comprising administering to the mammalian subject a cell that has been contacted with the recombinant parvovirus particle of the invention under conditions sufficient for the parvovirus particle vector genome to enter the cell.

A further aspect of the invention relates to a method of administering a nucleic acid to a mammalian subject comprising administering to the mammalian subject the recombinant parvovirus particle of the invention.

An additional aspect of the invention relates to a parvovirus template comprising (i) a heterologous nucleotide sequence, and (ii) at least one snake AAV ITR sequence and a parvovirus particle comprising the parvovirus template.

A further aspect of the invention relates to a method of producing a parvovirus particle, comprising providing to a cell permissive for parvovirus replication: (a) a recombinant parvovirus template comprising (i) a heterologous nucleotide sequence, and (ii) at least one snake AAV ITR sequence; (b) a polynucleotide encoding snake AAV Rep protein and mammalian AAV Cap protein; (c) a polynucleotide encoding mammalian Rep52 and/or Rep40 proteins; under conditions sufficient for the replication and packaging of the recombinant parvovirus template; whereby recombinant parvovirus particles comprising a parvovirus capsid encoded by the cap coding sequences and packaging the recombinant parvovirus template are produced in the cell.

Another aspect of the invention relates to use of the recombinant parvovirus particle of the invention for delivering a nucleic acid to a cell.

An additional aspect of the invention relates to use of a cell that has been contacted with the recombinant parvovirus particle of the invention for delivering a nucleic acid to a mammalian subject.

A further aspect of the invention relates to use of the recombinant parvovirus particle of the invention for delivering a nucleic acid to a mammalian subject.

Another aspect of the invention relates to use of the recombinant parvovirus particle of the invention for the manufacture of a medicament for delivering a nucleic acid to a cell.

An additional aspect of the invention relates to use of a cell that has been contacted with the recombinant parvovirus particle of the invention for the manufacture of a medicament for delivering a nucleic acid to a mammalian subject.

A further aspect of the invention relates to use of the recombinant parvovirus particle of the invention for the manufacture of a medicament for delivering a nucleic acid to a mammalian subject.

Moreover, the present invention also contemplates that in some embodiments of the invention, any feature or combination of features set forth herein can be excluded or omitted.

To illustrate further, if, for example, the specification indicates that a particular amino acid can be selected from A, G, I, L and/or V, this language also indicates that the amino acid can be selected from any subset of these amino acid(s) for example A, G, I or L; A, G, I or V; A or G; only L; etc. as if each such subcombination is expressly set forth herein. Moreover, such language also indicates that one or more of the specified amino acids can be disclaimed. For example, in particular embodiments the amino acid is not A, G or I; is not A; is not G or V; etc. as if each such possible disclaimer is expressly set forth herein.

These and other aspects of the invention are set forth in more detail in the description of the invention below.

BRIEF DESCRIPTION OF THE DRAWINGS

FIGS. 1A-1B show the cloning and characterization of chimeric ITRs. (FIG. 1A) Sequence and structure of ITR2 (SEQ ID NO:17) (black) and ITR5 (SEQ ID NO:18) (italic) shown with incorporation of SfiI sites for cloning (bold). Length in nt of ITR elements indicated above brackets. RBE is boxed. RBE′ is indicated by a hatched circle. Nicking stem is extruded with arrow indicating the nicking site and hatched box indicating the trs. The four initial chimeric ITRs generated (SEQ ID NOS:19-22) are shown (right). (FIG. 1B) Replication assay and quantitation of chimeric Reps. Replication products from the indicated ITR and either Rep2 or Rep5 were analyzed by Southern blot. Monomeric (m) and dimeric (d) replicating species are indicated. The level of replication of each sample was measured by densitometric analysis and compared to wt replication.

FIGS. 2A-2D show the relation of nicking stem height and sequence to Rep-ITR specificity. (FIG. 2A) Sequence of nicking stem in an otherwise ITR2 context (SEQ ID NOS:17, 18, 23, 25, 30, 32, 28). Arrow indicates trs site. Brackets indicate height of putative stems in nt from the base of the stem to the putative nicking site. Predicted ΔG values for the hairpins are below. Southern blot analysis of the ITRs replicated by Rep2 or Rep5 are shown below. (FIG. 2B) Quantitation of the Southern blots relative to wt replication from FIG. 2A. (FIG. 2C) Same as FIG. 2A, except nicking stems indicated were used in an ITR5 context (SEQ ID NOS:17, 18, 24, 26, 35). (FIG. 2D) Quantitation of the Southern blots relative to wt replication from FIG. 2C.

FIGS. 3A-3D show the effect of RBE-nicking stem spacing on Rep-ITR specificity. (FIG. 3A) ITR2 mutants were synthesized with the indicated spacing between the RBE and nicking stem (SEQ ID NOS:17, 31, 33). (FIG. 3B) Southern blot analysis of the ITRs depicted in FIG. 3A replicated by either Rep2 or Rep5 (Left). Quantitation of Southern blots relative to wt replication (Right). (FIG. 3C) ITR5 mutants synthesized as in FIG. 3A (SEQ ID NOS:34, 18, 37, 38). (FIG. 3D) Southern blot analysis and quantitation of FIG. 3C.

FIGS. 4A-4D demonstrate that the ITR5 spacer acts as a RBE for Rep5. (FIG. 4A) ITR5 mutants were synthesized with the indicated RBE and spacer sequence (SEQ ID NOS:18, 40, 39, 42). Brackets indicate individual tetranucleotide repeats bound by Rep monomers. Both strands of the wt ITR5 sequence are shown to illustrate conservation with the GAGY motif (indicated by *). Only one strand shown on others. (FIG. 4B) Southern blot analysis of the ITRs depicted in FIG. 4A replicated by either Rep2 or Rep5 (Left). Quantitation of Southern blots relative to wt replication (Right). (FIG. 4C) ITR2 mutants were generated with the RBE and spacer sequences indicated (SEQ ID NOS:17, 29, 41, 43). (FIG. 4D) Southern blot analysis and quantitation for FIG. 4C.

FIGS. 5A-5E show the cloning and characterization of chimeric Reps. (FIG. 5A) An alignment of the N-termini of Rep2 (SEQ ID NO:114) and Rep5 (SEQ ID NO:118). (*) represents conserved amino acids. (:) and (.) indicate conservative substitutions. (^) indicates residues implicated in RBE binding interactions. (′) indicates residues which participate in the endonucleolytic active site. (+) indicates residues implicated in RBE′ binding. (FIG. 5B) Chimeric Reps created and their ability to replicate ITR2 or ITR5 flanked vectors. Numbers indicate the amino acid (aa) position of the switch from one Rep to the other. (+) indicates the presence of replication, (−) indicates the absence. (FIG. 5C) Western blot for expression of the chimeric Reps. (FIG. 5D) Southern blot demonstrating replication of an ITR2 or an ITR5 vector by the chimeric Reps. Note that the ITR5 vector is 500 bp larger than the ITR2 vector. (FIG. 5E) Level of replication of the chimeric Reps relative to wt Rep2 or Rep5.

FIGS. 6A-6G show the characterization of Rep regions involved in ITR specificity. (FIG. 6A) Chimeric Reps and their ability to replicate ITR2 or ITR5 flanked vectors. Numbers indicate the aa position of the switch from one Rep to the other. (+) indicates the presence of replication, (−) indicates the absence. Region 1 and 2 involved in Rep-ITR specificity are indicated. (FIG. 6B) Western blot for expression of chimeric Reps. (FIG. 6C) Southern blot demonstrating replication of an ITR2 or ITR5 vector by the chimeric Reps. Note that the ITR5 vector is 500 bp larger than the ITR2 vector. (FIG. 6D) Structural model illustrating the two Rep regions. The nucleophilic tyrosine is indicated. Black hatched circle indicates the predicted structural difference of region 1 in the major groove of the ITR. (FIG. 6E) Structural model as in FIG. 6D. The nucleophilic tyrosine is indicated. (FIG. 6F) Detailed structural view of region 1. The side-chains of non-conserved residues from Rep5 and Rep2 are shown. Three Rep5 residues implicated in RBE′ binding are indicated. (FIG. 6G) Detailed structural view of region 2. Side chains of active site residues are shown in black. Side chains of non-conserved residues in this region are shown for Rep2 and Rep5. The nucleophilic tyrosine is indicated, as is the adjacent Rep2 Asn-155.

FIGS. 7A-7C show a model of Rep-ITR specificity. (FIG. 7A) Southern blot of Hirt DNA demonstrating replication of the indicated ITR vector by the indicated Rep. (FIG. 7B) Table indicating the presence (+) or absence (−) of replication of the gel from FIG. 7A. (FIG. 7C) Model of a novel AAV origin of replication. The chimeric ITR can be replicated only by a chimeric Rep protein. Rep5 sequence in region 1 is required for the extended RBE of ITR5. Rep2 sequence in region 2 is required to function on an ITR2 nicking stem.

FIG. 8 shows an illustrative genomic DNA sequence for AAV-1; GenBank Accession No. NC 002077; SEQ ID NO:1.

FIG. 9 shows an illustrative genomic DNA sequence for AAV-2; GenBank Accession No. NC 001401; SEQ ID NO:2.

FIG. 10 shows an illustrative genomic DNA sequence for AAV-3A; GenBank Accession No. NC 001729; SEQ ID NO:3.

FIG. 11 shows an illustrative genomic DNA sequence for AAV-3B; GenBank Accession No. NC 001863; SEQ ID NO:4.

FIG. 12 shows an illustrative genomic DNA sequence for AAV-4; GenBank Accession No. NC 001829; SEQ ID NO:5.

FIG. 13 shows an illustrative genomic DNA sequence for AAV-5; GenBank Accession No. NC 006152; SEQ ID NO:6.

FIG. 14 shows an illustrative genomic DNA sequence for AAV-6; GenBank Accession No. NC 001862; SEQ ID NO:7.

FIG. 15 shows an illustrative genomic DNA sequence for AAV-7; GenBank Accession No. AF513851; SEQ ID NO:8.

FIG. 16 shows an illustrative genomic DNA sequence for AAV-8; GenBank Accession No. AF513852; SEQ ID NO:9.

FIG. 17 shows an illustrative genomic DNA sequence for AAV-9; GenBank Accession No. AX753250; SEQ ID NO:10.

FIG. 18 shows an illustrative genomic DNA sequence for AAV-11; GenBank Accession No. AY631966; SEQ ID NO:11.

FIG. 19 shows an illustrative genomic DNA sequence for AAV-13; GenBank Accession No. EU285562; SEQ ID NO:12.

FIG. 20 shows an illustrative genomic DNA sequence for B19 parvovirus; GenBank Accession No. NC 000883; SEQ ID NO:13.

FIG. 21 shows an illustrative genomic DNA sequence for Minute Virus from Mouse (MVM); GenBank Accession No. NC 001510; SEQ ID NO:14.

FIG. 22 shows an illustrative genomic DNA sequence for goose parvovirus; GenBank Accession No. NC 001701; SEQ ID NO:15.

FIG. 23 shows an illustrative genomic DNA sequence for snake parvovirus 1; GenBank Accession No. NC 006148; SEQ ID NO:16.

FIG. 24 provides an exemplary listing of the chimeric ITRs that were synthesized as part of the Examples described below: ITR2 (SEQ ID NO:17), ITR5 (SEQ ID NO:18), ITR5+2SNS (SEQ ID NO:19), ITR2+5SNS (SEQ ID NO:20), ITR5+2NS (SEQ ID NO:21), ITR2+5NS (SEQ ID NO:22), ITR2−TA (SEQ ID NO:23), ITR5+TA (SEQ ID NO:24), ITR2−GC (SEQ ID NO:25), ITR5+GC (SEQ ID NO:26), ITR2−2 nt (SEQ ID NO:27), ITR2 5 nt (SEQ ID NO:28), ITR2+7 (SEQ ID NO:29), ITR2 9 nt (SEQ ID NO:30), ITR2 10 nt (SEQ ID NO:31), ITR2 11 nt (SEQ ID NO:32), ITR2 15 nt (SEQ ID NO:33), ITR5 3 nt (SEQ ID NO:34), ITR5 6 nt (SEQ ID NO:35), ITR5 9 bp NS (SEQ ID NO:36), ITR5 21 nt (SEQ ID NO:37), ITR5 30 nt (SEQ ID NO:38), ITR5 GAGY (SEQ ID NO:39), ITR5 no GAGY (SEQ ID NO:40), ITR2+8 nt GAGY (SEQ ID NO:41), ITR5 Spacer RBE (SEQ ID NO:42), ITR2+8−8 Spacer RBE (SEQ ID NO:43), ITR5 with ITR2 hairpins (SEQ ID NO:44), ITR2 no hairpins (SEQ ID NO:45), ITR2 T1 (SEQ ID NO:46), ITR2 T2 (SEQ ID NO:47), ITR2 T2 #2 (SEQ ID NO:48), ITR2 T3 (SEQ ID NO:49), ITR2 T4 (SEQ ID NO:50), ITR5+3 nt Spacer & ITR5 NS (SEQ ID NO:51), and ITR2 pHpa8 (SEQ ID NO:52).

FIG. 25 provides an exemplary listing of the chimeric Rep proteins that were synthesized as part of the Examples described below: Rep52aa73 (SEQ ID NO:53), Rep52aa84 (SEQ ID NO:54), Rep52aa110 (SEQ ID NO:55), Rep52aa126 (SEQ ID NO:56), Rep52aa138 (SEQ ID NO:57), Rep52aa160 (SEQ ID NO:58), Rep52aa175 (SEQ ID NO:59), Rep52aa187 (SEQ ID NO:60), Rep52aa207 (SEQ ID NO:61), Rep25aa73 (SEQ ID NO:62), Rep25aa77 (SEQ ID NO:63), Rep25aa97 (SEQ ID NO:64), Rep25aa116 (SEQ ID NO:65), Rep25aa125 (SEQ ID NO:66), Rep25aa141 (SEQ ID NO:67), Rep25aa149 (SEQ ID NO:68), Rep25aa166 (SEQ ID NO:69), Rep25aa187 (SEQ ID NO:70), Rep25aa216 (SEQ ID NO:71), Rep525aa110-148 (SEQ ID NO:72), Rep525aa146-187 (SEQ ID NO:73), Rep525aa110-187 (SEQ ID NO:74), Rep252aa97-146 (SEQ ID NO:75), Rep252aa149-187 (SEQ ID NO:76), and Rep252aa97-187 (SEQ ID NO:77).

FIG. 26 shows both the nucleotide and amino acid sequences of a chimeric Rep protein of the invention: Rep52aa146 (SEQ ID NO:78 and SEQ ID NO:79, respectively).

FIG. 27 shows both the nucleotide and amino acid sequences of a chimeric Rep protein of the invention: Rep52aa147 (SEQ ID NO:80 and SEQ ID NO:81, respectively).

FIG. 28 shows both the nucleotide and amino acid sequences of a chimeric Rep protein of the invention: Rep52aa151 (SEQ ID NO:82 and SEQ ID NO:83, respectively).

FIG. 29 shows an alignment of the amino acid sequences of exemplary Rep40 proteins from AAV1 (SEQ ID NO:84), AAV2 (SEQ ID NO:85), AAV3A (SEQ ID NO:86), AAV3B (SEQ ID NO:87), AAV4 (SEQ ID NO:88), AAV5 (SEQ ID NO:89), AAV6 (SEQ ID NO:90), AAV7 (SEQ ID NO:91) and AAV8 (SEQ ID NO:92), as well as a consensus sequence (SEQ ID NO:93). Dashes indicate gaps in the sequence and shading indicates positions of sequence homology.

FIG. 30 shows an alignment of the amino acid sequences of exemplary Rep52 proteins from AAV1 (SEQ ID NO:94), AAV2 (SEQ ID NO:95), AAV3A (SEQ ID NO:96), AAV3B (SEQ ID NO:97), AAV4 (SEQ ID NO:98), AAV5 (SEQ ID NO:99), AAV6 (SEQ ID NO:100), AAV7 (SEQ ID NO:101) and AAV8 (SEQ ID NO:102), as well as a consensus sequence (SEQ ID NO:103). Dashes indicate gaps in the sequence and shading indicates positions of sequence homology.

FIG. 31 shows an alignment of the amino acid sequences of exemplary Rep68 proteins from AAV1 (SEQ ID NO:104), AAV2 (SEQ ID NO:105), AAV3A (SEQ ID NO:106), AAV3B (SEQ ID NO:107), AAV4 (SEQ ID NO:108), AAV5 (SEQ ID NO:109), AAV6 (SEQ ID NO:110), AAV7 (SEQ ID NO:111) and AAV8 (SEQ ID NO:112). Dashes indicate gaps in the sequence and shading indicates positions of sequence homology.

FIG. 32 shows an alignment of the amino acid sequences of exemplary Rep78 proteins from AAV1 (SEQ ID NO:113), AAV2 (SEQ ID NO:114), AAV3A (SEQ ID NO:115), AAV3B (SEQ ID NO:116), AAV4 (SEQ ID NO:117), AAV5 (SEQ ID NO:118), AAV6 (SEQ ID NO:119), AAV7 (SEQ ID NO:120) and AAV8 (SEQ ID NO:121), as well as a consensus sequence (SEQ ID NO:122). Dashes indicate gaps in the sequence and shading indicates positions of sequence homology.

FIG. 33 shows the nucleotide sequence of the snake ITR utilized in Example 9 (SEQ ID NO:123).

FIG. 34 shows the nucleotide sequence of the snake ITR eGFP vector plasmid (SEQ ID NO:124) used to synthesize the snake vector described in Example 9.

FIG. 35 shows the nucleotide sequence of the pSnRepCap2 plasmid (SEQ ID NO:125) used to synthesize the snake vector described in Example 9.

FIGS. 36A-36C shows a diagram of ITR synthesis. (FIG. 36A) The ITR was synthesized in two pieces (

and

) overlapping across one hairpin stem holding the SfiI site (

). (FIG. 36B) Each half was amplified via PCR prior to digestion and cloning. (FIG. 36C) Proper triple-ligation with pUC18-CMV GFP produced an ITR in DD format.

DETAILED DESCRIPTION OF THE INVENTION

The present invention will now be described with reference to the accompanying drawings, in which preferred embodiments of the invention are shown. This invention may, however, be embodied in different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art.

Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. The terminology used in the description of the invention herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety.

Nucleotide sequences are presented herein by single strand only, in the 5′ to 3′ direction, from left to right, unless specifically indicated otherwise. Nucleotides and amino acids are represented herein in the manner recommended by the IUPAC-IUB Biochemical Nomenclature Commission, or (for amino acids) by either the one-letter code, or the three letter code, both in accordance with 37 CFR § 1.822 and established usage. See, e.g., PatentIn User Manual, 99-102 (November 1990) (U.S. Patent and Trademark Office).

Except as otherwise indicated, standard methods known to those skilled in the art may be used for the construction of recombinant parvovirus and rAAV constructs, packaging vectors expressing the parvovirus Rep and/or Cap sequences, and transiently and stably transfected packaging cells. Such techniques are known to those skilled in the art. See, e.g., SAMBROOK et al. MOLECULAR CLONING: A LABORATORY MANUAL 2nd Ed. (Cold Spring Harbor, N.Y., 1989); AUSUBEL et al. CURRENT PROTOCOLS IN MOLECULAR BIOLOGY(Green Publishing Associates, Inc. and John Wiley & Sons, Inc., New York).

Definitions

The following terms are used in the description herein and the appended claims:

The singular forms “a” and “an” are intended to include the plural forms as well, unless the context clearly indicates otherwise.

Furthermore, the term “about,” as used herein when referring to a measurable value such as an amount of the length of a polynucleotide or polypeptide sequence, dose, time, temperature, and the like, is meant to encompass variations of 20%, 10%, 5%, 1%, 0.5%, or even 0.1% of the specified amount.

Also as used herein, “and/or” refers to and encompasses any and all possible combinations of one or more of the associated listed items, as well as the lack of combinations when interpreted in the alternative (“or”).

As used herein, the transitional phrase “consisting essentially of” is to be interpreted as encompassing the recited materials or steps “and those that do not materially affect the basic and novel characteristic(s)” of the claimed invention (e.g., rAAV replication). See, In re Herz, 537 F.2d 549, 551-52, 190 U.S.P.Q. 461, 463 (CCPA 1976) (emphasis in the original); see also MPEP § 2111.03. Thus, the term “consisting essentially of” as used herein should not be interpreted as equivalent to “comprising.”

The term “parvovirus” as used herein encompasses the family Parvoviridae, including autonomously-replicating parvoviruses and dependoviruses. The autonomous parvoviruses include members of the genera Parvovirus, Erythrovirus, Densovirus, Iteravirus, and Contravirus. Exemplary autonomous parvoviruses include, but are not limited to, minute virus of mouse, bovine parvovirus, canine parvovirus, chicken parvovirus, feline panleukopenia virus, feline parvovirus, goose parvovirus, H1 parvovirus, muscovy duck parvovirus, snake parvovirus, and B19 virus (See, e.g., FIGS. 20-23). Other autonomous parvoviruses are known to those skilled in the art. See, e.g., FIELDS et al. VIROLOGY, volume 2, chapter 69 (4th ed., Lippincott-Raven Publishers).

The genus Dependovirus contains the adeno-associated viruses (AAV), including but not limited to, AAV type 1, AAV type 2, AAV type 3 (including types 3A and 3B), AAV type 4, AAV type 5, AAV type 6, AAV type 7, AAV type 8, AAV type 9, AAV type 10, AAV type 11, AAV type 12, AAV type 13, avian AAV, bovine AAV, canine AAV, goat AAV, snake AAV, equine AAV, and ovine AAV. See, e.g., FIGS. 8-19; FIELDS et al. VIROLOGY, volume 2, chapter 69 (4th ed., Lippincott-Raven Publishers); and Table 1.

TABLE 1 GenBank Accession Number Complete Genomes Adeno-associated virus 1 NC_002077, AF063497 Adeno-associated virus 2 NC_001401 Adeno-associated virus 3 NC_001729 Adeno-associated virus 3B NC_001863 Adeno-associated virus 4 NC_001829 Adeno-associated virus 5 Y18065, AF085716 Adeno-associated virus 6 NC_001862 Avian AAV ATCC VR-865 AY186198, AY629583, NC_004828 Avian AAV strain DA-1 NC_006263, AY629583 Bovine AAV NC_005889, AY388617 Clade A AAV1 NC_002077, AF063497 AAV6 NC_001862 Hu.48 AY530611 Hu 43 AY530606 Hu 44 AY530607 Hu 46 AY530609 Clade B Hu. 19 AY530584 Hu. 20 AY530586 Hu 23 AY530589 Hu22 AY530588 Hu24 AY530590 Hu21 AY530587 Hu27 AY530592 Hu28 AY530593 Hu 29 AY530594 Hu63 AY530624 Hu64 AY530625 Hu13 AY530578 Hu56 AY530618 Hu57 AY530619 Hu49 AY530612 Hu58 AY530620 Hu34 AY530598 Hu35 AY530599 AAV2 NC_001401 Hu45 AY530608 Hu47 AY530610 Hu51 AY530613 Hu52 AY530614 Hu T41 AY695378 Hu S17 AY695376 Hu T88 AY695375 Hu T71 AY695374 Hu T70 AY695373 Hu T40 AY695372 Hu T32 AY695371 Hu T17 AY695370 Hu LG15 AY695377 Clade C Hu9 AY530629 Hu10 AY530576 Hu11 AY530577 Hu53 AY530615 Hu55 AY530617 Hu54 AY530616 Hu7 AY530628 Hu18 AY530583 Hu15 AY530580 Hu16 AY530581 Hu25 AY530591 Hu60 AY530622 Ch5 AY243021 Hu3 AY530595 Hu1 AY530575 Hu4 AY530602 Hu2 AY530585 Hu61 AY530623 Clade D Rh62 AY530573 Rh48 AY530561 Rh54 AY530567 Rh55 AY530568 Cy2 AY243020 AAV7 AF513851 Rh35 AY243000 Rh37 AY242998 Rh36 AY242999 Cy6 AY243016 Cy4 AY243018 Cy3 AY243019 Cy5 AY243017 Rh13 AY243013 Clade E Rh38 AY530558 Hu66 AY530626 Hu42 AY530605 Hu67 AY530627 Hu40 AY530603 Hu41 AY530604 Hu37 AY530600 Rh40 AY530559 Rh2 AY243007 Bb1 AY243023 Bb2 AY243022 Rh10 AY243015 Hu17 AY530582 Hu6 AY530621 Rh25 AY530557 Pi2 AY530554 Pi1 AY530553 Pi3 AY530555 Rh57 AY530569 Rh50 AY530563 Rh49 AY530562 Hu39 AY530601 Rh58 AY530570 Rh61 AY530572 Rh52 AY530565 Rh53 AY530566 Rh51 AY530564 Rh64 AY530574 Rh43 AY530560 AAV8 AF513852 Rh8 AY242997 Rh1 AY530556 Clade F Hu14 (AAV9) AY530579 Hu31 AY530596 Hu32 AY530597 Clonal Isolate AAV5 Y18065, AF085716 AAV3 NC_001729 AAV3B NC_001863 AAV4 NC_001829 Rh34 AY243001 Rh33 AY243002 Rh32 AY243003

As used herein, the term “adeno-associated virus” (AAV), includes but is not limited to, AAV type 1, AAV type 2, AAV type 3 (including types 3A and 3B), AAV type 4, AAV type 5, AAV type 6, AAV type 7, AAV type 8, AAV type 9, AAV type 10, AAV type 11, AAV type 12, AAV type 13, snake AAV, avian AAV, bovine AAV, canine AAV, equine AAV, ovine AAV, goat AAV, shrimp AAV, and any other AAV now known or later discovered. See, e.g., FIELDS et al. VIROLOGY, volume 2, chapter 69 (4th ed., Lippincott-Raven Publishers). A number of relatively new AAV serotypes and clades have been identified (See, e.g., Gao et al. (2004) J. Virol. 78:6381; Moris et al. (2004) Virol. 33-:375; and Table 1).

The parvovirus particles and genomes of the present invention can be from, but are not limited to, AAV. The genomic sequences of various serotypes of AAV and the autonomous parvoviruses, as well as the sequences of the native ITRs, Rep proteins, and capsid subunits are known in the art. Such sequences may be found in the literature or in public databases such as GenBank. See, e.g., FIGS. 8-23; GenBank Accession Numbers NC_002077, NC_001401, NC_001729, NC_001863, NC_001829, NC_001862, NC_000883, NC_001701, NC_001510, NC_006152, NC_006261, AF063497, U89790, AF043303, AF028705, AF028704, J02275, J01901, J02275, X01457, AF288061, AH009962, AY028226, AY028223, AY631966, AX753250, EU285562, NC_001358, NC_001540, AF513851, AF513852 and AY530579; the disclosures of which are incorporated by reference herein for teaching parvovirus and AAV nucleic acid and amino acid sequences. See also, e.g., Bantel-Schaal et al. (1999) J. Virol. 73: 939; Chiorini et al. (1997) J. Virol. 71:6823; Chiorini et al. (1999) J. Virol. 73:1309; Gao et al. (2002) Proc. Nat. Acad. Sci. USA 99:11854; Moris et al. (2004) Virol. 33-:375-383; Mori et al. (2004) Virol. 330:375; Muramatsu et al. (1996) Virol. 221:208; Ruffing et al. (1994) J. Gen. Virol. 75:3385; Rutledge et al. (1998) J. Virol. 72:309; Schmidt et al. (2008) J. Virol. 82:8911; Shade et al., (1986) J. Virol. 58:921; Srivastava et al. (1983) J. Virol. 45:555; Xiao et al. (1999) J. Virol. 73:3994; international patent publications WO 00/28061, WO 99/61601, WO 98/11244; and U.S. Pat. No. 6,156,303; the disclosures of which are incorporated by reference herein for teaching parvovirus and AAV nucleic acid and amino acid sequences. See also Table 1. An early description of the AAV1, AAV2 and AAV3 ITR sequences is provided by Xiao, X., (1996), “Characterization of Adeno-associated virus (AAV) DNA replication and integration,” Ph.D. Dissertation, University of Pittsburgh, Pittsburgh, Pa. (incorporated herein it its entirety).

The term “tropism” as used herein refers to entry of the virus into the cell, optionally and preferably followed by expression (e.g., transcription and, optionally, translation) of sequences carried by the viral genome in the cell, e.g., for a recombinant virus, expression of the heterologous nucleotide sequences(s). Those skilled in the art will appreciate that transcription of a heterologous nucleic acid sequence from the viral genome may not be initiated in the absence of trans-acting factors, e.g., for an inducible promoter or otherwise regulated nucleic acid sequence. In the case of AAV, gene expression from the viral genome may be from a stably integrated provirus, from a non-integrated episome, as well as any other form in which the virus may take within the cell.

As used herein, “transduction” or “infection” of a cell by a parvovirus or AAV means that the parvovirus/AAV enters the cell to establish an active (i.e., lytic) infection. As used herein, “transduction” of a cell by AAV means that the AAV enters the cell to establish a latent infection. See, e.g., FIELDS et al. VIROLOGY, volume 2, chapter 69 (3d ed., Lippincott-Raven Publishers).

The terms “5′ portion” and “3′ portion” are relative terms to define a spatial relationship between two or more elements. Thus, for example, a “3′ portion” of a polynucleotide indicates a segment of the polynucleotide that is downstream of another segment. The term “3′ portion” is not intended to indicate that the segment is necessarily at the 3′ end of the polynucleotide, or even that it is necessarily in the 3′ half of the polynucleotide, although it may be. Likewise, a “5′ portion” of a polynucleotide indicates a segment of the polynucleotide that is upstream of another segment. The term “5′ portion” is not intended to indicate that the segment is necessarily at the 5′ end of the polynucleotide, or even that it is necessarily in the 5′ half of the polynucleotide, although it may be.

As used herein, the term “polypeptide” encompasses both peptides and proteins, unless indicated otherwise.

A “polynucleotide” is a sequence of nucleotide bases, and may be RNA, DNA or DNA-RNA hybrid sequences (including both naturally occurring and non-naturally occurring nucleotide), and can be either single or double stranded DNA sequences.

The term “sequence identity,” as used herein, has the standard meaning in the art. As is known in the art, a number of different programs can be used to identify whether a polynucleotide or polypeptide has sequence identity or similarity to a known sequence. Sequence identity or similarity may be determined using standard techniques known in the art, including, but not limited to, the local sequence identity algorithm of Smith & Waterman, Adv. Appl. Math. 2:482 (1981), by the sequence identity alignment algorithm of Needleman & Wunsch, J. Mol. Biol. 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Natl. Acad. Sci. USA 85:2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Drive, Madison, Wis.), the Best Fit sequence program described by Devereux et al., Nucl. Acid Res. 12:387 (1984), preferably using the default settings, or by inspection.

An example of a useful algorithm is PILEUP. PILEUP creates a multiple sequence alignment from a group of related sequences using progressive, pairwise alignments. It can also plot a tree showing the clustering relationships used to create the alignment. PILEUP uses a simplification of the progressive alignment method of Feng & Doolittle, J. Mol. Evol. 35:351 (1987); the method is similar to that described by Higgins & Sharp, CABIOS 5:151 (1989).

Another example of a useful algorithm is the BLAST algorithm, described in Altschul et al., J. Mol. Biol. 215:403 (1990) and Karlin et al., Proc. Natl. Acad. Sci. USA 90:5873 (1993). A particularly useful BLAST program is the WU-BLAST-2 program which was obtained from Altschul et al., Meth. Enzymol., 266:460 (1996); blast.wustl/edu/blast/README.html. WU-BLAST-2 uses several search parameters, which are preferably set to the default values. The parameters are dynamic values and are established by the program itself depending upon the composition of the particular sequence and composition of the particular database against which the sequence of interest is being searched; however, the values may be adjusted to increase sensitivity.

An additional useful algorithm is gapped BLAST as reported by Altschul et al., Nucleic Acids Res. 25:3389 (1997).

A percentage amino acid sequence identity value is determined by the number of matching identical residues divided by the total number of residues of the “longer” sequence in the aligned region. The “longer” sequence is the one having the most actual residues in the aligned region (gaps introduced by WU-Blast-2 to maximize the alignment score are ignored).

In a similar manner, percent nucleic acid sequence identity with respect to the coding sequence of the polypeptides disclosed herein is defined as the percentage of nucleotide residues in the candidate sequence that are identical with the nucleotides in the polynucleotide specifically disclosed herein.

The alignment may include the introduction of gaps in the sequences to be aligned. In addition, for sequences which contain either more or fewer amino acids than the polypeptides specifically disclosed herein, it is understood that in one embodiment, the percentage of sequence identity will be determined based on the number of identical amino acids in relation to the total number of amino acids. Thus, for example, sequence identity of sequences shorter than a sequence specifically disclosed herein, will be determined using the number of amino acids in the shorter sequence, in one embodiment. In percent identity calculations relative weight is not assigned to various manifestations of sequence variation, such as insertions, deletions, substitutions, etc.

In one embodiment, only identities are scored positively (+1) and all forms of sequence variation including gaps are assigned a value of “0,” which obviates the need for a weighted scale or parameters as described below for sequence similarity calculations. Percent sequence identity can be calculated, for example, by dividing the number of matching identical residues by the total number of residues of the “shorter” sequence in the aligned region and multiplying by 100. The “longer” sequence is the one having the most actual residues in the aligned region.

As used herein, an “isolated” polynucleotide (e.g., an “isolated DNA” or an “isolated RNA”) means a polynucleotide separated or substantially free from at least some of the other components of the naturally occurring organism or virus, for example, the cell or viral structural components or other polypeptides or nucleic acids commonly found associated with the polynucleotide.

Likewise, an “isolated” polypeptide means a polypeptide that is separated or substantially free from at least some of the other components of the naturally occurring organism or virus, for example, the cell or viral structural components or other polypeptides or nucleic acids commonly found associated with the polypeptide.

A “therapeutic polypeptide” is a polypeptide that may alleviate or reduce symptoms that result from an absence or defect in a protein in a cell or subject. Alternatively, a “therapeutic polypeptide” is one that otherwise confers a benefit to a subject, e.g., anti-cancer effects or improvement in transplant survivability.

As used herein, the term “modified,” as applied to a polynucleotide or polypeptide sequence, refers to a sequence that differs from a wild-type sequence due to one or more deletions, additions, substitutions, or any combination thereof.

As used herein, by “isolate” or “purify” (or grammatical equivalents) a virus vector, it is meant that the virus vector is at least partially separated from at least some of the other components in the starting material.

By the terms “treat,” “treating” or “treatment of” (and grammatical variations thereof) it is meant that the severity of the subject's condition is reduced, at least partially improved or stabilized and/or that some alleviation, mitigation, decrease or stabilization in at least one clinical symptom is achieved and/or there is a delay in the progression of the disease or disorder.

The terms “prevent,” “preventing” and “prevention” (and grammatical variations thereof) refer to prevention and/or delay of the onset of a disease, disorder and/or a clinical symptom(s) in a subject and/or a reduction in the severity of the onset of the disease, disorder and/or clinical symptom(s) relative to what would occur in the absence of the methods of the invention. The prevention can be complete, e.g., the total absence of the disease, disorder and/or clinical symptom(s). The prevention can also be partial, such that the occurrence of the disease, disorder and/or clinical symptom(s) in the subject and/or the severity of onset is less than what would occur in the absence of the present invention.

A “treatment effective” amount as used herein is an amount that is sufficient to provide some improvement or benefit to the subject. Alternatively stated, a “treatment effective” amount is an amount that will provide some alleviation, mitigation, decrease or stabilization in at least one clinical symptom in the subject. Those skilled in the art will appreciate that the therapeutic effects need not be complete or curative, as long as some benefit is provided to the subject.

A “prevention effective” amount as used herein is an amount that is sufficient to prevent and/or delay the onset of a disease, disorder and/or clinical symptoms in a subject and/or to reduce and/or delay the severity of the onset of a disease, disorder and/or clinical symptoms in a subject relative to what would occur in the absence of the methods of the invention. Those skilled in the art will appreciate that the level of prevention need not be complete, as long as some benefit is provided to the subject.

The terms “heterologous nucleotide sequence” and “heterologous nucleic acid” are used interchangeably herein and refer to a sequence that is not naturally occurring in the virus. Generally, the heterologous nucleic acid comprises an open reading frame that encodes a polypeptide or nontranslated RNA of interest (e.g., for delivery to a cell or subject).

As used herein, the terms “virus vector,” “vector” or “gene delivery vector” refer to a virus (e.g., AAV) particle that functions as a nucleic acid delivery vehicle, and which comprises the vector genome (e.g., viral DNA [vDNA]) packaged within a virion. Alternatively, in some contexts, the term “vector” may be used to refer to the vector genome/vDNA alone.

The virus vectors of the invention can further be duplexed parvovirus particles as described in international patent publication WO 01/92551 (the disclosure of which is incorporated herein by reference in its entirety). Thus, in some embodiments, double stranded (duplex) genomes can be packaged into the virus capsids of the invention.

A “rAAV vector genome” or “rAAV genome” is an AAV genome (i.e., vDNA) that comprises one or more heterologous nucleic acid sequences. rAAV vectors generally require only the 145 base ITR in cis to generate virus. All other viral sequences are dispensable and may be supplied in trans (Muzyczka (1992) Curr. Topics Microbiol. Immunol. 158:97). Typically, the rAAV vector genome will only retain the one or more ITR sequence so as to maximize the size of the transgene that can be efficiently packaged by the vector. The structural and non-structural protein coding sequences may be provided in trans (e.g., from a vector, such as a plasmid, or by stably integrating the sequences into a packaging cell). In embodiments of the invention the rAAV vector genome comprises at least one ITR sequence (e.g., AAV ITR sequence), optionally two ITRs (e.g., two AAV ITRs), which typically will be at the 5′ and 3′ ends of the vector genome and flank the heterologous nucleic acid, but need not be contiguous thereto. The ITRs can be the same or different from each other.

The term “terminal repeat” or “TR” includes any viral terminal repeat or synthetic sequence that forms a hairpin structure and functions as an inverted terminal repeat (i.e., mediates the desired functions such as replication, virus packaging, integration and/or provirus rescue, and the like). The ITR can be an AAV ITR or a non-AAV ITR. For example, a non-AAV ITR sequence such as those of other parvoviruses (e.g., canine parvovirus, bovine parvovirus, mouse parvovirus, porcine parvovirus, human parvovirus B-19) or the SV40 hairpin that serves as the origin of SV40 replication can be used as an ITR, which can further be modified by truncation, substitution, deletion, insertion and/or addition. Further, the ITR can be partially or completely synthetic, such as the “double-D sequence” as described in U.S. Pat. No. 5,478,745 to Samulski et al. FIG. 24 provides examples of synthetic ITRs contemplated by the present invention.

Parvovirus genomes have palindromic sequences at both their 5′ and 3′ ends. The palindromic nature of the sequences leads to the formation of a hairpin structure that is stabilized by the formation of hydrogen bonds between the complementary base pairs. This hairpin structure is believed to adopt a “Y” or a “T” shape. See, e.g., FIELDS et al. VIROLOGY, volume 2, chapters 69 & 70 (4th ed., Lippincott-Raven Publishers).

An “AAV inverted terminal repeat” or “AAV ITR” may be from any AAV, including but not limited to serotypes 1, 2, 3a, 3b, 4, 5, 6, 7, 8, 9, 10, 11, or 13, snake AAV, avian AAV, bovine AAV, canine AAV, equine AAV, ovine AAV, goat AAV, shrimp AAV, or any other AAV now known or later discovered (see, e.g., Table 1). An AAV ITR need not have the native terminal repeat sequence (e.g., a native AAV ITR sequence may be altered by insertion, deletion, truncation and/or missense mutations), as long as the terminal repeat mediates the desired functions, e.g., replication, virus packaging, integration, and/or provirus rescue, and the like.

The virus vectors of the invention can further be “targeted” virus vectors (e.g., having a directed tropism) and/or a “hybrid” parvovirus (i.e., in which the viral ITRs and viral capsid are from different parvoviruses) as described in international patent publication WO 00/28004 and Chao et al., (2000) Mol. Therapy 2:619.

Further, the viral capsid or genomic elements can contain other modifications, including insertions, deletions and/or substitutions.

As used herein, the term “amino acid” encompasses any naturally occurring amino acids, modified forms thereof, and synthetic amino acids.

Naturally occurring, levorotatory (L-) amino acids are shown in Table 2.

TABLE 2 Abbreviation Amino Acid Residue Three-Letter Code One-Letter Code Alanine Ala A Arginine Arg R Asparagine Asn N Aspartic acid (Aspartate) Asp D Cysteine Cys C Glutamine Gln Q Glutamic acid (Glutamate) Glu E Glycine Gly G Histidine His H Isoleucine Ile I Leucine Leu L Lysine Lys K Methionine Met M Phenylalanine Phe F Proline Pro P Serine Ser S Threonine Thr T Tryptophan Trp W Tyrosine Tyr Y Valine Val V

Alternatively, the amino acid can be a modified amino acid residue (nonlimiting examples are shown in Table 3) or can be an amino acid that is modified by post-translation modification (e.g., acetylation, amidation, formylation, hydroxylation, methylation, phosphorylation or sulfatation).

TABLE 3 Amino Acid Residue Derivatives Modified Amino Acid Residue Abbreviation 2-Aminoadipic acid Aad 3-Aminoadipic acid bAad beta-Alanine, beta-Aminoproprionic acid bAla 2-Aminobutyric acid Abu 4-Aminobutyric acid, Piperidinic acid 4Abu 6-Aminocaproic acid Acp 2-Aminoheptanoic acid Ahe 2-Aminoisobutyric acid Aib 3-Aminoisobutyric acid bAib 2-Aminopimelic acid Apm t-butylalanine t-BuA Citrulline Cit Cyclohexylalanine Cha 2,4-Diaminobutyric acid Dbu Desmosine Des 2,2′-Diaminopimelic acid Dpm 2,3-Diaminoproprionic acid Dpr N-Ethylglycine EtGly N-Ethylasparagine EtAsn Homoarginine hArg Homocysteine hCys Homoserine hSer Hydroxylysine Hyl Allo-Hydroxylysine aHyl 3-Hydroxyproline 3Hyp 4-Hydroxyproline 4Hyp Isodesmosine Ide allo-Isoleucine aIle Methionine sulfoxide MSO N-Methylglycine, sarcosine MeGly N-Methylisoleucine MeIle 6-N-Methyllysine MeLys N-Methylvaline MeVal 2-Naphthylalanine 2-Nal Norvaline Nva Norleucine Nle Ornithine Orn 4-Chlorophenylalanine Phe(4-Cl) 2-Fluorophenylalanine Phe(2-F) 3-Fluorophenylalanine Phe(3-F) 4-Fluorophenylalanine Phe(4-F) Phenylglycine Phg Beta-2-thienylalanine Thi

Further, the non-naturally occurring amino acid can be an “unnatural” amino acid as described by Wang et al. (2006) Annu. Rev. Biophys. Biomol. Struct. 35:225-49. These unnatural amino acids can advantageously be used to chemically link molecules of interest to the AAV capsid protein.

The term “template” or “substrate” is used herein to refer to a polynucleotide sequence that may be replicated to produce the parvovirus viral DNA. For the purpose of vector production, the template will typically be embedded within a larger nucleotide sequence or construct, including but not limited to a plasmid, naked DNA vector, bacterial artificial chromosome (BAC), yeast artificial chromosome (YAC) or a viral vector (e.g., adenovirus, herpesvirus, Epstein-Barr Virus, AAV, baculoviral, retroviral vectors, and the like). Alternatively, the template may be stably incorporated into the chromosome of a packaging cell.

As used herein, parvovirus or AAV “Rep coding sequences” indicate the nucleic acid sequences that encode the parvoviral or AAV non-structural proteins that mediate viral replication and the production of new virus particles. The parvovirus and AAV replication genes and proteins have been described in e.g., FIELDS et al. VIROLOGY, volume 2, chapters 69 & 70 (4th ed., Lippincott-Raven Publishers).

The “Rep coding sequences” need not encode all of the parvoviral or AAV Rep proteins. For example, with respect to AAV, the Rep coding sequences do not need to encode all four AAV Rep proteins (Rep78, Rep 68, Rep52 and Rep40), in fact, it is believed that AAV5 only expresses the spliced Rep68 and Rep40 proteins. In representative embodiments, the Rep coding sequences encode at least those replication proteins that are necessary for viral genome replication and packaging into new virions. The Rep coding sequences will generally encode at least one large Rep protein (i.e., Rep78/68) and one small Rep protein (i.e., Rep52/40). In particular embodiments, the Rep coding sequences encode the AAV Rep78 protein and the AAV Rep52 and/or Rep40 proteins. In other embodiments, the Rep coding sequences encode the Rep68 and the Rep52 and/or Rep40 proteins. In a still further embodiment, the Rep coding sequences encode the Rep68 and Rep52 proteins, Rep68 and Rep40 proteins, Rep78 and Rep52 proteins, or Rep78 and Rep40 proteins.

As used herein, the term “large Rep protein” refers to Rep68 and/or Rep78. Large Rep proteins of the claimed invention may be either wild-type or synthetic. A wild-type large Rep protein may be from any AAV, including but not limited to serotypes 1, 2, 3a, 3b, 4, 5, 6, 7, 8, 9, 10, 11, or 13, or any other AAV now known or later discovered (see, e.g., Table 1). A synthetic large Rep protein may be altered by insertion, deletion, truncation and/or missense mutations.

Those skilled in the art will further appreciate that it is not necessary that the replication proteins be encoded by the same polynucleotide. For example, for MVM, the NS-1 and NS-2 proteins (which are splice variants) may be expressed independently of one another. Likewise, for AAV, the p19 promoter may be inactivated and the large Rep protein(s) expressed from one polynucleotide and the small Rep protein(s) expressed from a different polynucleotide. Typically, however, it will be more convenient to express the replication proteins from a single construct. In some systems, the viral promoters (e.g., AAV p19 promoter) may not be recognized by the cell, and it is therefore necessary to express the large and small Rep proteins from separate expression cassettes. In other instances, it may be desirable to express the large Rep and small Rep proteins separately, i.e., under the control of separate transcriptional and/or translational control elements. For example, it may be desirable to control expression of the large Rep proteins, so as to decrease the ratio of large to small Rep proteins. In the case of insect cells, it may be advantageous to down-regulate expression of the large Rep proteins (e.g., Rep78/68) to avoid toxicity to the cells (see, e.g., Urabe et al., (2002) Human Gene Therapy 13:1935).

As used herein, the term “synthetic large Rep protein” refers to a large Rep protein having an amino acid sequence that differs from a wild-type large Rep protein sequence. The sequence of the synthetic large Rep protein may differ from a wild-type sequence due to one or more deletions, additions, substitutions, or any combination thereof. The difference between the synthetic and wild-type sequences may be as little as a single amino acid change, e.g., a change in 1, 2, 3, 4, 5,6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 60, 60, 70, 80, 90, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400. 425, 450, 475, 500, 525, 550, 575, or 600 or more amino acids or any range therein. In certain embodiments, the synthetic large Rep protein is a chimeric Rep protein comprising portions of the wild-type sequence of two or more different large Rep proteins. In other embodiments, the synthetic large Rep protein is a chimeric Rep protein comprising portions of the wild-type sequence of two or more different large Rep proteins, one or more portions of which have been modified from the wild-type sequence.

As used herein, the parvovirus or AAV “cap coding sequences” encode the structural proteins that form a functional parvovirus or AAV capsid (i.e., can package DNA and infect target cells). Typically, the cap coding sequences will encode all of the parvovirus or AAV capsid subunits, but less than all of the capsid subunits may be encoded as long as a functional capsid is produced. Typically, but not necessarily, the cap coding sequences will be present on a single nucleic acid molecule.

The capsid structure of autonomous parvoviruses and AAV are described in more detail in BERNARD N. FIELDS et al., VIROLOGY, volume 2, chapters 69 & 70 (4th ed., Lippincott-Raven Publishers).

As used herein, the term “structural element,” when used with respect to a parvovirus ITR, refers to a portion of the ITR that, based on nucleotide sequence, secondary structure, or both, plays a role in the functional interaction of a large Rep protein with the ITR, e.g., a portion that, when removed from the ITR, prevents functional interaction with a large Rep protein. In some embodiments, the structural element physically interacts with the large Rep protein.

As used herein, the term “functionally interacts” refers to an interaction between an ITR and a large Rep protein (e.g., binding) that ultimately results in nicking of the ITR and replication of a polynucleotide in which the ITR is present.

As used herein, the term “nicking stem” refers to a hairpin loop structure present in a parvovirus ITR that is nicked by a large Rep protein during replication of a polynucleotide in which the ITR is present.

As used herein, the term “extended RBE” refers to the nucleotide sequence of a parvovirus ITR between the nicking stem and the RBE (the spacer sequence as shown in FIG. 1A) which, in certain parvoviruses (e.g., AAV5), functions as an extension of the RBE (i.e., is recognized and bound by a large Rep protein). The term “extended RBE” is only applicable to the spacer sequence when the sequence functions as an extension of the RBE.

Modified Parvovirus ITRs

The present invention provides modified parvovirus ITRs and synthetic Rep proteins that functionally interact with the modified ITRs. The modified ITRs are unique in that they do not functionally interact with wild-type Rep proteins and may reduce or avoid vector mobilization.

One aspect of the invention relates to a polynucleotide comprising at least one parvovirus ITR, wherein the ITR comprises: (a) a first structural element that functionally interacts with a large Rep protein from a AAV but does not functionally interact with a large Rep protein from a second AAV; and (b) a second structural element that functionally interacts with the large Rep protein from the second AAV but does not functionally interact with the large Rep protein from the first AAV; wherein the ITR functionally interacts with a synthetic AAV large Rep protein. In one embodiment, the ITR does not functionally interact with any wild-type large Rep protein, e.g., AAV2 Rep, AAV5 Rep, or any other known Rep protein. In particular embodiments, the synthetic large Rep protein comprises an amino acid sequence selected from the group consisting of SEQ ID NOS:79, 81, and 83 or an amino acid sequence having at least 80% identity to one of SEQ ID NOS: 79, 81, and 83, e.g., at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity. In one embodiment, the ITR further comprises a third structural element that functionally interacts with a large Rep protein from an AAV that is the same as or different from the first and/or second AAV.

In one embodiment of the invention, the parvovirus ITR is from an autonomous parvovirus. In another embodiment, the parvovirus ITR is from an AAV, e.g., an AAV selected from the group consisting of AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12, and AAV13. In a further embodiment, the parvovirus ITR is from a non-human AAV such as snake AAV, avian AAV, bovine AAV, canine AAV, equine AAV, ovine AAV, goat AAV, or shrimp AAV.

The structural element of the ITR can be any structural element that is involved in the functional interaction of the ITR with a large Rep protein. In certain embodiments, the structural element provides selectivity to the interaction of an ITR with a large Rep protein, i.e., determines at least in part which Rep protein functionally interacts with the ITR. In other embodiments, the structural element physically interacts with a large Rep protein when the Rep protein is bound to the ITR. Each structural element can be, e.g., a secondary structure of the ITR, a nucleotide sequence of the ITR, a spacing between two or more elements, or a combination of any of the above. In one embodiment, the structural elements are selected from the group consisting of a nicking stem, a spacer, a RBE, an extended RBE, and any combination thereof. In a particular embodiment, the first structural element is a nicking stem. In another embodiment, the second structural element is a RBE. In a further embodiment, the second structural element is an extended RBE. In an additional embodiment, the second structural element is a spacer.

The ability of a structural element to functionally interact with a particular large Rep protein can be altered by modifying the structural element. For example, the nucleotide sequence of the structural element can be modified as compared to the wild-type sequence of the ITR. In one embodiment, the structural element (e.g., the nicking stem, spacer, RBE, and/or extended RBE) of an ITR can be removed and replaced with a wild-type structural element from a different parvovirus. For example, the replacement structure can be from AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12, AAV13, snake parvovirus (e.g., royal python parvovirus), bovine parvovirus, goat parvovirus, avian parvovirus, canine parvovirus, equine parvovirus, shrimp parvovirus, porcine parvovirus, or insect AAV. For example, the ITR can be an AAV2 ITR and the nicking stem or RBE can be replaced with a structural element from AAV5. In another example, the ITR can be an AAV5 ITR and the nicking stem, RBE, or extended RBE can be replaced with a structural element from AAV2. In one example, the ITR can be an AAV2 ITR with the nicking stem replaced with the AAV5 ITR nicking stem, e.g., the ITR of SEQ ID NO:22 or a modified sequence thereof. In another example, the AAV ITR can be an AAV5 ITR with the nicking stem replaced with the AAV2 ITR nicking stem, e.g., the ITR of SEQ ID NO:21 or a modified sequence thereof.

In one embodiment, the nucleotide sequence of the structural element can be modified (e.g., by modifying 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 or more nucleotides or any range therein) to produce a synthetic structural element. In certain embodiments, the specific ITRs exemplified herein (SEQ ID NOS:17-52) can be modified (e.g., by modifying 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 or more nucleotides or any range therein). In other embodiments, the ITR can have at least 80% identity with one of the ITRs of SEQ ID NOS:17-52, e.g., at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity. In one embodiment, the structural element is a nicking stem and the modified sequence is a modified terminal resolution site (trs) sequence. For example, a nicking stem can be modified to comprise the ITR2 trs (GGT/TGG) or the ITR5 trs (AGTG/TGG). In another embodiment, the structural element is a RBE or an extended RBE and the sequence is a modified at the nucleotides responsible for binding specificity. For example, the sequence of a RBE or an extended RBE can be modified to make the sequence closer to or further from the consensus GAGY binding sites recognized by Rep. In one example, the spacer or extended RBE can be modified to comprise one or more exact GAGY repeats (e.g., the ITR of SEQ ID NO:39 or a modified sequence thereof), e.g., 1, 2, 3, or 4 or more exact GAGY repeats.

In a different embodiment, the structure of the structural element can be modified. For example, the structural element can be a nicking stem and the modification can be a change in the height of the stem and/or the number of nucleotides in the loop. For example, the height of the stem can be about 2, 3, 4, 5, 6, 7, 8, or 9 nucleotides or more or any range therein. In one embodiment, the nicking stem height can be about 5 nucleotides to about 9 nucleotides and functionally interacts with Rep2. In another embodiment, the nicking stem height can be about 7 nucleotides and functionally interacts with Rep5. In another example, the loop can have 3, 4, 5, 6, 7, 8, 9, or 10 nucleotides or more or any range therein. In another example, the structural element can be a RBE or an extended RBE and the number of GAGY binding sites or GAGY-related binding sites within the RBE or extended RBE can be increased or decreased. In one example, the RBE or extended RBE, can comprise 1, 2, 3, 4, 5, or 6 or more GAGY binding sites or any range therein. Each GAGY binding site can independently be an exact GAGY sequence or a sequence similar to GAGY as long as the sequence is sufficient to bind a Rep protein.

In another embodiment, the spacing between two elements (such as the nicking stem and the RBE or the RBE and a hairpin) can be altered (e.g., increased or decreased) to alter functional interaction with a large Rep protein. For example, the spacing can be about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, or 21 nucleotides or more or any range therein. In one embodiment, the spacer between the nicking stem and the RBE is about 3 nucleotides in length and functionally interacts with Rep2. In another embodiment, the spacer between the nicking stem and the RBE is about 3 nucleotides (e.g., the ITR of SEQ ID NO:34 or a modified sequence thereof) to about 21 nucleotides in length (e.g., the ITR of SEQ ID NO:37 or a modified sequence thereof) and functionally interacts with Rep5. In one embodiment, the spacer is the 15 nucleotide spacer of the AAV5 ITR or a sequence having at least 80% identity thereto, e.g., at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity.

In a representative embodiment, the polynucleotide comprises at least one parvovirus ITR, wherein said ITR comprises: (a) a first structural element that functionally interacts with a large Rep protein from one or more of AAV1, AAV2, AAV3, AAV4, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12, and AAV13 but does not functionally interact with a large Rep protein from AAV5; and (b) a second structural element that functionally interacts with the large Rep protein from AAV5 but does not functionally interact with the large Rep protein from one or more of AAV1, AAV2, AAV3, AAV4, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12, and AAV13; wherein the ITR functionally interacts with a synthetic AAV large Rep protein comprising an amino acid sequence selected from SEQ ID NOS: 79, 81, and 83.

In one aspect of the invention the polynucleotide comprising the modified ITR of the invention further comprises a second ITR which may be the same as or different from the first ITR. In one embodiment, the polynucleotide further comprises a heterologous nucleic acid, e.g., a sequence encoding a protein or a functional RNA. In some embodiments, the second ITR cannot be resolved by the Rep protein, i.e., resulting in a double stranded viral DNA.

The invention also provides a viral vector comprising the polynucleotide comprising the modified ITR of the invention. The viral vector can be a parvovirus vector, e.g., an AAV vector. The invention further provides a recombinant parvovirus particle (e.g., a recombinant AAV particle) comprising the modified ITR of the invention. Viral vectors and viral particles are discussed further below.

Synthetic Rep Proteins

One aspect of the invention relates to synthetic large Rep proteins that functionally interact with the modified ITRs of the invention. Thus, in one aspect, the invention relates to a synthetic large Rep protein comprising a first portion that functionally interacts with a first structural element of a parvovirus ITR and a second portion that functionally interacts with a second structural element of a parvovirus ITR, wherein said first structural element functionally interacts with a large Rep protein from a first AAV but does not functionally interact with a large Rep protein from a second AAV and said second structural element functionally interacts with a large Rep protein from a second AAV but does not functionally interact with a large Rep protein from the first AAV. In one embodiment, the protein comprises a third portion that functionally interacts with a third structural element that functionally interacts with a large Rep protein from an AAV that is the same as or different from the first and/or second AAV. In one embodiment, the first structural element is a nicking stem and the first portion of the synthetic large Rep protein functional interacts with the nicking stem. In another embodiment, the second structural element is a spacer, RBE, or extended RBE and the second portion of the synthetic large Rep protein functional interacts with the spacer, RBE, or extended RBE.

In one embodiment, one or more portions of the synthetic large Rep protein comprise a wild-type amino acid sequence from a parvovirus Rep protein. In another embodiment, one or more portions of the synthetic large Rep protein comprise an amino acid sequence that is modified as compared to the wild-type sequence of a parvovirus Rep protein. The modification can be an addition, deletion, substitution, or any combination thereof. The synthetic large Rep protein can comprise one or more modified amino acids, e.g., about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 525, 550, 575, or 600 or more amino acids or any range therein.

In one embodiment of the invention, the first and second portions (and/or the third portion) are directly linked to each other. In another embodiment, the portions are connected by a linker, e.g., 1, 2, 3, 4, 5, or 6 or more amino acids. The synthetic large Rep protein can comprise further portions (e.g., from Rep or another protein or synthetic sequences) that are not involved in the functional interaction with an ITR. Examples of other sequences can include, without limitation, localization signals, tags for improved isolation, etc.

In one embodiment, the first portion of the synthetic large Rep protein comprises, consists essentially of, or consists of an amino acid sequence from about residue 97 to about residues 146-151 of a wild-type AAV5 Rep sequence, e.g., SEQ ID NO:118. For example the first portion can comprise, consist essentially of, or consist of an amino acid sequence from about residue 1, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, or 95 to about residue 146, 147, 148, 149, 151, or 151 of a wild-type AAV5 Rep sequence or any range therein. In certain embodiments, the first portion comprises, consists essentially of, or consists of an amino acid sequence having at least 80% identity to a sequence from about residue 97 to about residues 146-151 of a wild-type AAV5 Rep sequence, e.g., at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity.

In one embodiment, the second portion of the synthetic large Rep protein comprises, consists essentially of, or consists of an amino acid sequence from about residue 149 to about residue 187 of a wild-type AAV2 Rep sequence, e.g., SEQ ID NO:114. For example, the second portion can comprise, consist essentially of, or consist of an amino acid sequence from about residue 149 to about residue 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 610, or 620 of a wild-type AAV2 Rep sequence or any range therein. In certain embodiments, the second portion comprises, consists essentially of, or consists of an amino acid sequence having at least 80% identity to a sequence from about residue 149 to about residue 187 of a wild-type AAV5 Rep sequence, e.g., at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity.

In a representative embodiment of the synthetic large Rep protein, the first portion comprises, consists essentially of, or consists of an amino acid sequence from about residue 97 to about residues 146-151 of a wild-type AAV5 Rep sequence and the second portion comprises, consists essentially of, or consists of an amino acid sequence from about residue 149 to about residue 187 of a wild-type AAV2 Rep sequence. In another representative embodiment, the first portion comprises, consists essentially of, or consists of an amino acid sequence from about residue 1 to about residues 146-151 of a wild-type AAV5 Rep sequence and the second portion comprises, consists essentially of, or consists of an amino acid sequence from about residue 149 to about residue 621 of a wild-type AAV2 Rep sequence. In certain embodiments, the synthetic large Rep protein comprises, consists essentially of, or consists of an amino acid sequence of SEQ ID NOS: 79, 81, and 83. In other embodiments, the synthetic large Rep protein comprises, consists essentially of, or consists of an amino acid sequence having at least 80% identity to an amino acid sequence of SEQ ID NOS: 79, 81, and 83, e.g., at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity.

In certain embodiments, the portion of the synthetic large Rep protein from a wild-type AAV2 Rep sequence as described above can be replaced with the corresponding portion from another human AAV serotype Rep protein other than AAV5, e.g., AAV1, AAV3, AAV4, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12, or AAV13. The structural and functional similarity between the Rep proteins of AAV2 and other human serotypes (with the exception of AAV5) may allow substitution of Rep sequences between the serotypes (see FIGS. 31 and 32).

In certain embodiments, one or more of the portions the synthetic Rep proteins can be modified to differ from the wild-type sequence (e.g., by modifying 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 or more aa or any range therein). In other embodiments, the synthetic Rep proteins exemplified herein can be modified (e.g., by modifying 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 or more aa or any range therein). In some embodiments, the modified synthetic Rep proteins retain amino acid Y156 (numbering according to Rep2). In other embodiments, modified synthetic Rep proteins retain amino acids C151, N155, and/or T161 (numbering according to Rep2). In other embodiments, modified synthetic Rep proteins retain amino acids G148, A152, and/or V158 (numbering according to Rep5). These specific amino acids may be important for activity and/or specificity.

The invention also provides polynucleotides (optionally, isolated polynucleotides) encoding the synthetic Rep proteins of the invention. In some embodiments, the polynucleotides further encode one or more parvovirus Cap proteins. Further provided are vectors comprising the polynucleotides, and cells (in vivo or in culture) comprising the polynucleotides and/or vectors of the invention. Suitable vectors include, without limitation, viral vectors (e.g., adenovirus, AAV, herpesvirus, vaccinia, poxviruses, baculoviruses, Epstein-Barr virus, and the like), plasmids, phage, YACs, BACs, and the like. In some embodiments, the polynucleotide is stably integrated into the genome of a cell. Such polynucleotides, vectors and cells can be used, for example, as reagents (e.g., helper packaging constructs or packaging cells) for the production of virus vectors as described herein.

Snake AAV ITRs

One aspect of the invention relates to the discovery that a snake AAV ITR sequence can function as a part of a parvovirus vector yet is not recognized by the Rep proteins of mammalian (e.g., human or primate) parvoviruses. Vector mobilization may therefore be reduced or avoided. Thus, one aspect of the invention relates to a parvovirus template comprising (i) a heterologous nucleotide sequence, and (ii) at least one snake AAV ITR sequence. The snake AAV ITR sequence can be from a royal python AAV. In one embodiment, the snake AAV ITR sequence comprises the nucleotide sequence of SEQ ID NO:123. In a further embodiment, the snake AAV ITR sequence comprises the nucleotide sequence of SEQ ID NO:123 that has been modified (e.g., by modifying 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 or more nucleotides or any range therein). In other embodiments, the parvovirus template comprises at least a portion of a snake AAV ITR, e.g., at least 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, or 150 or more contiguous nucleotides of a snake AAV ITR or any range therein. In certain embodiments, the parvovirus template comprises two snake AAV ITR sequences.

The invention further relates to a parvovirus particle comprising the snake parvovirus template of the invention. In certain embodiments, the parvovirus particle comprises a mammalian capsid, e.g., a human or primate capsid.

In one aspect, the invention relates to the discovery of methods for producing parvovirus particles comprising a snake AAV ITR, including the requirement for a mammalian small Rep protein. Thus, one aspect of the invention relates to a method of producing a parvovirus particle, comprising providing to a cell (e.g., a mammalian cell such as a human or primate cell) permissive for parvovirus replication: (a) a recombinant parvovirus template comprising (i) a heterologous nucleotide sequence, and (ii) at least one snake AAV ITR sequence; (b) a polynucleotide encoding one or more snake AAV Rep proteins and mammalian AAV Cap protein(s); and (c) a polynucleotide encoding mammalian Rep52 and/or Rep40 proteins; under conditions sufficient for the replication and packaging of the recombinant parvovirus template; whereby recombinant parvovirus particles packaging the recombinant parvovirus template are produced in the cell. In one embodiment, the mammalian AAV Cap protein is a human or primate AAV Cap protein. In another embodiment, the mammalian AAV Rep 52 and/or Rep 40 proteins are human or primate Rep52 and/or Rep40 proteins (including modified forms thereof), e.g., from AAV2. In some embodiments, the polynucleotide encoding snake AAV Rep protein and mammalian AAV Cap protein also encodes the mammalian Rep52 and/or Rep40 proteins. In other embodiments, the polynucleotide encoding snake AAV Rep protein and mammalian AAV Cap protein is separate from the polynucleotide encoding the mammalian Rep52 and/or Rep40 proteins. In some embodiments, the polynucleotide encoding snake AAV Rep protein and mammalian AAV Cap protein is the plasmid pSnRepCap2 (SEQ ID NO:125).

In other embodiments, other non-human AAV ITR sequences not recognized by the Rep proteins of human or primate parvoviruses may be used. Examples include, without limitation, sequences from shrimp, insect, goat, bovine, equine, canine, and equine AAVs.

Methods of Producing Virus Vectors

The present invention further provides methods of producing virus vectors. In one particular embodiment, the present invention provides a method of producing a recombinant parvovirus particle, comprising providing to a cell permissive for parvovirus replication: (a) a recombinant parvovirus template comprising (i) a heterologous nucleotide sequence, and (ii) the modified parvovirus ITR of the invention; (b) a polynucleotide encoding a synthetic large Rep protein of the invention; under conditions sufficient for the replication and packaging of the recombinant parvovirus template; whereby recombinant parvovirus particles are produced in the cell. Conditions sufficient for the replication and packaging of the recombinant parvovirus template can be, e.g., the presence of AAV sequences sufficient for replication of the parvovirus template and encapsidation into parvovirus capsids (e.g., parvovirus rep sequences and parvovirus cap sequences) and helper sequences from adenovirus and/or herpesvirus. In particular embodiments, the parvovirus template comprises two parvovirus ITR sequences, which are located 5′ and 3′ to the heterologous nucleic acid sequence, although they need not be directly contiguous thereto.

In some embodiments, the recombinant parvovirus template comprises an ITR that is not resolved by Rep to make duplexed AAV vectors as described in international patent publication WO 01/92551

The parvovirus template and parvovirus rep and cap sequences are provided under conditions such that virus vector comprising the parvovirus template packaged within the parvovirus capsid is produced in the cell. The method can further comprise the step of collecting the virus vector from the cell. The virus vector can be collected from the medium and/or by lysing the cells.

The cell can be a cell that is permissive for parvovirus viral replication. Any suitable cell known in the art may be employed. In particular embodiments, the cell is a mammalian cell (e.g., a primate or human cell). As another option, the cell can be a trans-complementing packaging cell line that provide functions deleted from a replication-defective helper virus, e.g., 293 cells or other E1a trans-complementing cells.

The parvovirus replication and capsid sequences may be provided by any method known in the art. Current protocols typically express the parvovirus rep/cap genes on a single plasmid. The parvovirus replication and packaging sequences need not be provided together, although it may be convenient to do so. The parvovirus rep and/or cap sequences may be provided by any viral or non-viral vector. For example, the rep/cap sequences may be provided by a hybrid adenovirus or herpesvirus vector (e.g., inserted into the E1a or E3 regions of a deleted adenovirus vector). EBV vectors may also be employed to express the parvovirus cap and rep genes. One advantage of this method is that EBV vectors are episomal, yet will maintain a high copy number throughout successive cell divisions (i.e., are stably integrated into the cell as extra-chromosomal elements, designated as an “EBV based nuclear episome,” see Margolski, (1992) Curr. Top. Microbiol. Immun. 158:67).

As a further alternative, the rep/cap sequences may be stably incorporated into a cell.

Typically the parvovirus rep/cap sequences will not be flanked by the TRs, to prevent rescue and/or packaging of these sequences.

The parvovirus template can be provided to the cell using any method known in the art. For example, the template can be supplied by a non-viral (e.g., plasmid) or viral vector. In particular embodiments, the parvovirus template is supplied by a herpesvirus or adenovirus vector (e.g., inserted into the E1a or E3 regions of a deleted adenovirus). As another illustration, Palombo et al., (1998) J. Virology 72:5025, describes a baculovirus vector carrying a reporter gene flanked by the AAV TRs. EBV vectors may also be employed to deliver the template, as described above with respect to the rep/cap genes.

In another representative embodiment, the parvovirus template is provided by a replicating rAAV virus. In still other embodiments, an AAV provirus comprising the parvovirus template is stably integrated into the chromosome of the cell.

To enhance virus titers, helper virus functions (e.g., adenovirus or herpesvirus) that promote a productive parvovirus infection can be provided to the cell. Helper virus sequences necessary for parvovirus replication are known in the art. Typically, these sequences will be provided by a helper adenovirus or herpesvirus vector. Alternatively, the adenovirus or herpesvirus sequences can be provided by another non-viral or viral vector, e.g., as a non-infectious adenovirus miniplasmid that carries all of the helper genes that promote efficient parvovirus production as described by Ferrari et al., (1997) Nature Med. 3:1295, and U.S. Pat. Nos. 6,040,183 and 6,093,570.

Further, the helper virus functions may be provided by a packaging cell with the helper sequences embedded in the chromosome or maintained as a stable extrachromosomal element. Generally, the helper virus sequences cannot be packaged into parvovirus virions, e.g., are not flanked by TRs.

Those skilled in the art will appreciate that it may be advantageous to provide the parvovirus replication and capsid sequences and the helper virus sequences (e.g., adenovirus sequences) on a single helper construct. This helper construct may be a non-viral or viral construct. As one nonlimiting illustration, the helper construct can be a hybrid adenovirus or hybrid herpesvirus comprising the parvovirus rep/cap genes.

In one particular embodiment, the parvovirus rep/cap sequences and the adenovirus helper sequences are supplied by a single adenovirus helper vector. This vector can further comprise the parvovirus template. The parvovirus rep/cap sequences and/or the parvovirus template can be inserted into a deleted region (e.g., the E1a or E3 regions) of the adenovirus.

In a further embodiment, the parvovirus rep/cap sequences and the adenovirus helper sequences are supplied by a single adenovirus helper vector. According to this embodiment, the parvovirus template can be provided as a plasmid template.

In another illustrative embodiment, the parvovirus rep/cap sequences and adenovirus helper sequences are provided by a single adenovirus helper vector, and the parvovirus template is integrated into the cell as a provirus. Alternatively, the parvovirus template is provided by an EBV vector that is maintained within the cell as an extrachromosomal element (e.g., as an EBV based nuclear episome).

In a further exemplary embodiment, the parvovirus rep/cap sequences and adenovirus helper sequences are provided by a single adenovirus helper. The parvovirus template can be provided as a separate replicating viral vector. For example, the parvovirus template can be provided by a parvovirus particle or a second recombinant adenovirus particle.

According to the foregoing methods, the hybrid adenovirus vector typically comprises the adenovirus 5′ and 3′ cis sequences sufficient for adenovirus replication and packaging (i.e., the adenovirus terminal repeats and PAC sequence). The parvovirus rep/cap sequences and, if present, the parvovirus template are embedded in the adenovirus backbone and are flanked by the 5′ and 3′ cis sequences, so that these sequences may be packaged into adenovirus capsids. As described above, the adenovirus helper sequences and the parvovirus rep/cap sequences are generally not flanked by TRs so that these sequences are not packaged into the parvovirus virions.

Zhang et al., ((2001) Gene Ther. 18:704-12) describe a chimeric helper comprising both adenovirus and the AAV rep and cap genes.

Herpesvirus may also be used as a helper virus in parvovirus packaging methods. Hybrid herpesviruses encoding the parvovirus Rep protein(s) may advantageously facilitate scalable parvovirus vector production schemes. A hybrid herpes simplex virus type I (HSV-1) vector expressing the AAV-2 rep and cap genes has been described (Conway et al., (1999) Gene Ther. 6:986 and WO 00/17377.

As a further alternative, the virus vectors of the invention can be produced in insect cells using baculovirus vectors to deliver the rep/cap genes and parvovirus template as described, for example, by Urabe et al., (2002) Human Gene Ther. 13:1935-43.

Parvovirus vector stocks free of contaminating helper virus may be obtained by any method known in the art. For example, parvovirus and helper virus may be readily differentiated based on size. Parvovirus may also be separated away from helper virus based on affinity for a heparin substrate (Zolotukhin et al. (1999) Gene Therapy 6:973). Deleted replication-defective helper viruses can be used so that any contaminating helper virus is not replication competent. As a further alternative, an adenovirus helper lacking late gene expression may be employed, as only adenovirus early gene expression is required to mediate packaging of parvovirus. Adenovirus mutants defective for late gene expression are known in the art (e.g., ts100K and ts149 adenovirus mutants).

Recombinant Virus Vectors

The virus vectors of the present invention are useful for the delivery of nucleic acids to cells in vitro, ex vivo, and in vivo. In particular, the virus vectors can be advantageously employed to deliver or transfer nucleic acids to animal, including mammalian, cells.

Any heterologous nucleic acid sequence(s) of interest may be delivered in the virus vectors of the present invention. Nucleic acids of interest include nucleic acids encoding polypeptides, including therapeutic (e.g., for medical or veterinary uses), immunogenic (e.g., for vaccines), or diagnostic polypeptides.

Therapeutic polypeptides include, but are not limited to, cystic fibrosis transmembrane regulator protein (CFTR), dystrophin (including mini- and micro-dystrophins (see, e.g, Vincent et al., (1993) Nature Genetics 5:130; U.S. Patent Publication No. 2003/017131; International publication WO/2008/088895, Wang et al., Proc. Natl. Acad. Sci. USA 97:13714-13719 (2000); and Gregorevic et al., Mol. Ther. 16:657-64 (2008)), myostatin propeptide, follistatin, activin type II soluble receptor, IGF-1, anti-inflammatory polypeptides such as the Ikappa B dominant mutant, sarcospan, utrophin (Tinsley et al., (1996) Nature 384:349), mini-utrophin, clotting factors (e.g., Factor VIII, Factor IX, Factor X, etc.), erythropoietin, angiostatin, endostatin, catalase, tyrosine hydroxylase, superoxide dismutase, leptin, the LDL receptor, lipoprotein lipase, ornithine transcarbamylase, β-globin, α-globin, spectrin, α₁-antitrypsin, adenosine deaminase, hypoxanthine guanine phosphoribosyl transferase, β-glucocerebrosidase, sphingomyelinase, lysosomal hexosaminidase A, branched-chain keto acid dehydrogenase, RP65 protein, cytokines (e.g., α-interferon, β-interferon, interferon-γ, interleukin-2, interleukin-4, granulocyte-macrophage colony stimulating factor, lymphotoxin, and the like), peptide growth factors, neurotrophic factors and hormones (e.g., somatotropin, insulin, insulin-like growth factors 1 and 2, platelet derived growth factor, epidermal growth factor, fibroblast growth factor, nerve growth factor, neurotrophic factor-3 and -4, brain-derived neurotrophic factor, bone morphogenic proteins [including RANKL and VEGF], glial derived growth factor, transforming growth factor-α and -β, and the like), lysosomal acid α-glucosidase, α-galactosidase A, receptors (e.g., the tumor necrosis growth factorα soluble receptor), S100A1, parvalbumin, adenylyl cyclase type 6, a molecule that effects G-protein coupled receptor kinase type 2 knockdown such as a truncated constitutively active bARKct, anti-inflammatory factors such as IRAP, anti-myostatin proteins, aspartoacylase, and monoclonal antibodies (including single chain monoclonal antibodies; an exemplary Mab is the Herceptin® Mab). Other illustrative heterologous nucleic acid sequences encode suicide gene products (e.g., thymidine kinase, cytosine deaminase, diphtheria toxin, and tumor necrosis factor), proteins conferring resistance to a drug used in cancer therapy, tumor suppressor gene products (e.g., p53, Rb, Wt-1), TRAIL, FAS-ligand, and any other polypeptide that has a therapeutic effect in a subject in need thereof. Parvovirus vectors can also be used to deliver monoclonal antibodies and antibody fragments, for example, an antibody or antibody fragment directed against myostatin (see, e.g., Fang et al., Nature Biotechnol. 23:584-590 (2005)).

Heterologous nucleic acid sequences encoding polypeptides include those encoding reporter polypeptides (e.g., an enzyme). Reporter polypeptides are known in the art and include, but are not limited to, Green Fluorescent Protein, β-galactosidase, alkaline phosphatase, luciferase, and chloramphenicol acetyltransferase gene.

Alternatively, in particular embodiments of this invention, the heterologous nucleic acid may encode an antisense nucleic acid, a ribozyme (e.g., as described in U.S. Pat. No. 5,877,022), RNAs that effect spliceosome-mediated trans-splicing (see, Puttaraju et al., (1999) Nature Biotech. 17:246; U.S. Pat. No. 6,013,487; U.S. Pat. No. 6,083,702), interfering RNAs (RNAi) including siRNA, shRNA or miRNA that mediate gene silencing (see, Sharp et al., (2000) Science 287:2431), and other non-translated RNAs, such as “guide” RNAs (Gorman et al., (1998) Proc. Nat. Acad. Sci. USA 95:4929; U.S. Pat. No. 5,869,248 to Yuan et al.), and the like. Exemplary untranslated RNAs include RNAi against a multiple drug resistance (MDR) gene product (e.g., to treat and/or prevent tumors and/or for administration to the heart to prevent damage by chemotherapy), RNAi against myostatin (e.g., for Duchenne muscular dystrophy), RNAi against VEGF (e.g., to treat and/or prevent tumors), RNAi against phospholamban (e.g., to treat cardiovascular disease, see, e.g., Andino et al., J. Gene Med. 10:132-142 (2008) and Li et al., Acta Pharmacol Sin. 26:51-55 (2005)); phospholamban inhibitory or dominant-negative molecules such as phospholamban S16E (e.g., to treat cardiovascular disease, see, e.g., Hoshijima et al. Nat. Med. 8:864-871 (2002)), RNAi to adenosine kinase (e.g., for epilepsy), RNAi to a sarcoglycan [e.g., α, β, γ], RNAi against myostatin, myostatin propeptide, follistatin, or activin type II soluble receptor, RNAi against anti-inflammatory polypeptides such as the Ikappa B dominant mutant, and RNAi directed against pathogenic organisms and viruses (e.g., hepatitis B virus, human immunodeficiency virus, CMV, herpes simplex virus, human papilloma virus, etc.).

Alternatively, in particular embodiments of this invention, the heterologous nucleic acid may encode protein phosphatase inhibitor I (I-1), serca2a, zinc finger proteins that regulate the phospholamban gene, Barkct, β2-adrenergic receptor, β2-adrenergic receptor kinase (BARK), phosphoinositide-3 kinase (PI3 kinase), a molecule that effects G-protein coupled receptor kinase type 2 knockdown such as a truncated constitutively active bARKct; calsarcin, RNAi against phospholamban; phospholamban inhibitory or dominant-negative molecules such as phospholamban S16E, enos, inos, or bone morphogenic proteins (including BNP 2, 7, etc., RANKL and/or VEGF).

The virus vector may also comprise a heterologous nucleic acid that shares homology with and recombines with a locus on a host chromosome. This approach can be utilized, for example, to correct a genetic defect in the host cell.

The present invention also provides virus vectors that express an immunogenic polypeptide, e.g., for vaccination. The nucleic acid may encode any immunogen of interest known in the art including, but not limited to, immunogens from human immunodeficiency virus (HIV), simian immunodeficiency virus (SIV), influenza virus, HIV or SIV gag proteins, tumor antigens, cancer antigens, bacterial antigens, viral antigens, and the like.

The use of parvoviruses as vaccine vectors is known in the art (see, e.g., Miyamura et al., (1994) Proc. Nat. Acad. Sci USA 91:8507; U.S. Pat. No. 5,916,563 to Young et al., U.S. Pat. No. 5,905,040 to Mazzara et al., U.S. Pat. No. 5,882,652, U.S. Pat. No. 5,863,541 to Samulski et al.). The antigen may be presented in the parvovirus capsid. Alternatively, the antigen may be expressed from a heterologous nucleic acid introduced into a recombinant vector genome. Any immunogen of interest as described herein and/or as is known in the art can be provided by the virus vector of the present invention.

An immunogenic polypeptide can be any polypeptide suitable for eliciting an immune response and/or protecting the subject against an infection and/or disease, including, but not limited to, microbial, bacterial, protozoal, parasitic, fungal and/or viral infections and diseases. For example, the immunogenic polypeptide can be an orthomyxovirus immunogen (e.g., an influenza virus immunogen, such as the influenza virus hemagglutinin (HA) surface protein or the influenza virus nucleoprotein, or an equine influenza virus immunogen) or a lentivirus immunogen (e.g., an equine infectious anemia virus immunogen, a Simian Immunodeficiency Virus (SIV) immunogen, or a Human Immunodeficiency Virus (HIV) immunogen, such as the HIV or SIV envelope GP160 protein, the HIV or SIV matrix/capsid proteins, and the HIV or SIV gag, pol and env genes products). The immunogenic polypeptide can also be an arenavirus immunogen (e.g., Lassa fever virus immunogen, such as the Lassa fever virus nucleocapsid protein and the Lassa fever envelope glycoprotein), a poxvirus immunogen (e.g., a vaccinia virus immunogen, such as the vaccinia L1 or L8 gene products), a flavivirus immunogen (e.g., a yellow fever virus immunogen or a Japanese encephalitis virus immunogen), a filovirus immunogen (e.g., an Ebola virus immunogen, or a Marburg virus immunogen, such as NP and GP gene products), a bunyavirus immunogen (e.g., RVFV, CCHF, and/or SFS virus immunogens), or a coronavirus immunogen (e.g., an infectious human coronavirus immunogen, such as the human coronavirus envelope glycoprotein, or a porcine transmissible gastroenteritis virus immunogen, or an avian infectious bronchitis virus immunogen). The immunogenic polypeptide can further be a polio immunogen, a herpes immunogen (e.g., CMV, EBV, HSV immunogens) a mumps immunogen, a measles immunogen, a rubella immunogen, a diphtheria toxin or other diphtheria immunogen, a pertussis antigen, a hepatitis (e.g., hepatitis A, hepatitis B, hepatitis C, etc.) immunogen, and/or any other vaccine immunogen now known in the art or later identified as an immunogen.

Alternatively, the immunogenic polypeptide can be any tumor or cancer cell antigen. Optionally, the tumor or cancer antigen is expressed on the surface of the cancer cell. Exemplary cancer and tumor cell antigens are described in S. A. Rosenberg (Immunity 10:281 (1991)). Other illustrative cancer and tumor antigens include, but are not limited to: BRCA1 gene product, BRCA2 gene product, gp100, tyrosinase, GAGE-1/2, BAGE, RAGE, LAGE, NY-ESO-1, CDK-4, β-catenin, MUM-1, Caspase-8, KIAA0205, HPVE, SART-1, PRAME, p15, melanoma tumor antigens (Kawakami et al., (1994) Proc. Natl. Acad. Sci. USA 91:3515; Kawakami et al., (1994) J. Exp. Med., 180:347; Kawakami et al., (1994) Cancer Res. 54:3124), MART-1, gp100 MAGE-1, MAGE-2, MAGE-3, CEA, TRP-1, TRP-2, P-15, tyrosinase (Brichard et al., (1993) J. Exp. Med. 178:489); HER-2/neu gene product (U.S. Pat. No. 4,968,603), CA 125, LK26, FB5 (endosialin), TAG 72, AFP, CA19-9, NSE, DU-PAN-2, CA50, SPan-1, CA72-4, HCG, STN (sialyl Tn antigen), c-erbB-2 proteins, PSA, L-CanAg, estrogen receptor, milk fat globulin, p53 tumor suppressor protein (Levine, (1993) Ann. Rev. Biochem. 62:623); mucin antigens (International Patent Publication No. WO 90/05142); telomerases; nuclear matrix proteins; prostatic acid phosphatase; papilloma virus antigens; and/or antigens now known or later discovered to be associated with the following cancers: melanoma, adenocarcinoma, thymoma, lymphoma (e.g., non-Hodgkin's lymphoma, Hodgkin's lymphoma), sarcoma, lung cancer, liver cancer, colon cancer, leukemia, uterine cancer, breast cancer, prostate cancer, ovarian cancer, cervical cancer, bladder cancer, kidney cancer, pancreatic cancer, brain cancer and any other cancer or malignant condition now known or later identified (see, e.g., Rosenberg, (1996) Ann. Rev. Med. 47:481-91).

As a further alternative, the heterologous nucleic acid can encode any polypeptide that is desirably produced in a cell in vitro, ex vivo, or in vivo. For example, the virus vectors may be introduced into cultured cells and the expressed gene product isolated therefrom.

It will be understood by those skilled in the art that the heterologous nucleic acid(s) of interest can be operably associated with appropriate control sequences. For example, the heterologous nucleic acid can be operably associated with expression control elements, such as transcription/translation control signals, origins of replication, polyadenylation signals, internal ribosome entry sites (IRES), promoters, and/or enhancers, and the like.

Those skilled in the art will appreciate that a variety of promoter/enhancer elements can be used depending on the level and tissue-specific expression desired. The promoter/enhancer can be constitutive or inducible, depending on the pattern of expression desired. The promoter/enhancer can be native or foreign and can be a natural or a synthetic sequence. By foreign, it is intended that the transcriptional initiation region is not found in the wild-type host into which the transcriptional initiation region is introduced.

In particular embodiments, the promoter/enhancer elements can be native to the target cell or subject to be treated. In representative embodiments, the promoters/enhancer element can be native to the heterologous nucleic acid sequence. The promoter/enhancer element is generally chosen so that it functions in the target cell(s) of interest. Further, in particular embodiments the promoter/enhancer element is a mammalian promoter/enhancer element. The promoter/enhancer element may be constitutive or inducible.

Inducible expression control elements are typically advantageous in those applications in which it is desirable to provide regulation over expression of the heterologous nucleic acid sequence(s). Inducible promoters/enhancer elements for gene delivery can be tissue-specific or -preferred promoter/enhancer elements, and include muscle specific or preferred (including cardiac, skeletal and/or smooth muscle specific or preferred), neural tissue specific or preferred (including brain-specific or preferred), eye specific or preferred (including retina-specific and cornea-specific), liver specific or preferred, bone marrow specific or preferred, pancreatic specific or preferred, spleen specific or preferred, and lung specific or preferred promoter/enhancer elements. Other inducible promoter/enhancer elements include hormone-inducible and metal-inducible elements. Exemplary inducible promoters/enhancer elements include, but are not limited to, a Tet on/off element, a RU486-inducible promoter, an ecdysone-inducible promoter, a rapamycin-inducible promoter, and a metallothionein promoter.

In embodiments wherein the heterologous nucleic acid sequence(s) is transcribed and then translated in the target cells, specific initiation signals are generally included for efficient translation of inserted protein coding sequences. These exogenous translational control sequences, which may include the ATG initiation codon and adjacent sequences, can be of a variety of origins, both natural and synthetic.

The virus vectors according to the present invention provide a means for delivering heterologous nucleic acids into a broad range of cells, including dividing and non-dividing cells. The virus vectors can be employed to deliver a nucleic acid of interest to a cell in vitro, e.g., to produce a polypeptide in vitro or for ex vivo gene therapy. The virus vectors are additionally useful in a method of delivering a nucleic acid to a subject in need thereof, e.g., to express an immunogenic or therapeutic polypeptide or a functional RNA. In this manner, the polypeptide or functional RNA can be produced in vivo in the subject. The subject can be in need of the polypeptide because the subject has a deficiency of the polypeptide. Further, the method can be practiced because the production of the polypeptide or functional RNA in the subject may impart some beneficial effect.

The virus vectors can also be used to produce a polypeptide of interest or functional RNA in cultured cells or in a subject (e.g., using the subject as a bioreactor to produce the polypeptide or to observe the effects of the functional RNA on the subject, for example, in connection with screening methods).

In general, the virus vectors of the present invention can be employed to deliver a heterologous nucleic acid encoding a polypeptide or functional RNA to treat and/or prevent any disease state for which it is beneficial to deliver a therapeutic polypeptide or functional RNA. Illustrative disease states include, but are not limited to: cystic fibrosis (cystic fibrosis transmembrane regulator protein) and other diseases of the lung, hemophilia A (Factor VIII), hemophilia B (Factor IX), thalassemia (β-globin), anemia (erythropoietin) and other blood disorders, Alzheimer's disease (GDF; neprilysin), multiple sclerosis (β-interferon), Parkinson's disease (glial-cell line derived neurotrophic factor [GDNF]), Huntington's disease (RNAi to remove repeats), amyotrophic lateral sclerosis, epilepsy (galanin, neurotrophic factors), and other neurological disorders, cancer (endostatin, angiostatin, TRAIL, FAS-ligand, cytokines including interferons; RNAi including RNAi against VEGF or the multiple drug resistance gene product), diabetes mellitus (insulin), muscular dystrophies including Duchenne (dystrophin, mini-dystrophin, insulin-like growth factor I, a sarcoglycan [e.g., α, β, γ], RNAi against myostatin, myostatin propeptide, follistatin, activin type II soluble receptor, anti-inflammatory polypeptides such as the Ikappa B dominant mutant, sarcospan, utrophin, mini-utrophin, RNAi against splice junctions in the dystrophin gene to induce exon skipping [see, e.g., WO/2003/095647], antisense against U7 snRNAs to induce exon skipping [see, e.g., WO/2006/021724], and antibodies or antibody fragments against myostatin or myostatin propeptide) and Becker, Gaucher disease (glucocerebrosidase), Hurler's disease (α-L-iduronidase), adenosine deaminase deficiency (adenosine deaminase), glycogen storage diseases (e.g., Fabry disease [α-galactosidase] and Pompe disease [lysosomal acid α-glucosidase]) and other metabolic defects, congenital emphysema (α1-antitrypsin), Lesch-Nyhan Syndrome (hypoxanthine guanine phosphoribosyl transferase), Niemann-Pick disease (sphingomyelinase), Tays Sachs disease (lysosomal hexosaminidase A), Maple Syrup Urine Disease (branched-chain keto acid dehydrogenase), retinal degenerative diseases (and other diseases of the eye and retina; e.g., PDGF for macular degeneration), diseases of solid organs such as brain (including Parkinson's Disease [GDNF], astrocytomas [endostatin, angiostatin and/or RNAi against VEGF], glioblastomas [endostatin, angiostatin and/or RNAi against VEGF]), liver, kidney, heart including congestive heart failure or peripheral artery disease (PAD) (e.g., by delivering protein phosphatase inhibitor I (I-1), serca2a, zinc finger proteins that regulate the phospholamban gene, Barkct, β2-adrenergic receptor, β2-adrenergic receptor kinase (BARK), phosphoinositide-3 kinase (PI3 kinase), S100A1, parvalbumin, adenylyl cyclase type 6, a molecule that effects G-protein coupled receptor kinase type 2 knockdown such as a truncated constitutively active bARKct; calsarcin, RNAi against phospholamban; phospholamban inhibitory or dominant-negative molecules such as phospholamban S16E, etc.), arthritis (insulin-like growth factors), joint disorders (insulin-like growth factor 1 and/or 2), intimal hyperplasia (e.g., by delivering enos, inos), improve survival of heart transplants (superoxide dismutase), AIDS (soluble CD4), muscle wasting (insulin-like growth factor I), kidney deficiency (erythropoietin), anemia (erythropoietin), arthritis (anti-inflammatory factors such as IRAP and TNFα soluble receptor), hepatitis (α-interferon), LDL receptor deficiency (LDL receptor), hyperammonemia (ornithine transcarbamylase), Krabbe's disease (galactocerebrosidase), Batten's disease, spinal cerebral ataxias including SCA1, SCA2 and SCA3, phenylketonuria (phenylalanine hydroxylase), autoimmune diseases, and the like. The invention can further be used following organ transplantation to increase the success of the transplant and/or to reduce the negative side effects of organ transplantation or adjunct therapies (e.g., by administering immunosuppressant agents or inhibitory nucleic acids to block cytokine production). As another example, bone morphogenic proteins (including BNP 2, 7, etc., RANKL and/or VEGF) can be administered with a bone allograft, for example, following a break or surgical removal in a cancer patient.

Gene transfer has substantial potential use for understanding and providing therapy for disease states. There are a number of inherited diseases in which defective genes are known and have been cloned. In general, the above disease states fall into two classes: deficiency states, usually of enzymes, which are generally inherited in a recessive manner, and unbalanced states, which may involve regulatory or structural proteins, and which are typically inherited in a dominant manner. For deficiency state diseases, gene transfer can be used to bring a normal gene into affected tissues for replacement therapy, as well as to create animal models for the disease using antisense mutations. For unbalanced disease states, gene transfer can be used to create a disease state in a model system, which can then be used in efforts to counteract the disease state. Thus, virus vectors according to the present invention permit the treatment and/or prevention of genetic diseases.

The virus vectors according to the present invention may also be employed to provide a functional RNA to a cell in vitro or in vivo. Expression of the functional RNA in the cell, for example, can diminish expression of a particular target protein by the cell. Accordingly, functional RNA can be administered to decrease expression of a particular protein in a subject in need thereof. Functional RNA can also be administered to cells in vitro to regulate gene expression and/or cell physiology, e.g., to optimize cell or tissue culture systems or in screening methods.

Virus vectors according to the instant invention find use in diagnostic and screening methods, whereby a nucleic acid of interest is transiently or stably expressed in a cell culture system, or alternatively, a transgenic animal model.

The virus vectors of the present invention can also be used for various non-therapeutic purposes, including but not limited to use in protocols to assess gene targeting, clearance, transcription, translation, etc., as would be apparent to one skilled in the art. The virus vectors can also be used for the purpose of evaluating safety (spread, toxicity, immunogenicity, etc.). Such data, for example, are considered by the United States Food and Drug Administration as part of the regulatory approval process prior to evaluation of clinical efficacy.

As a further aspect, the virus vectors of the present invention may be used to produce an immune response in a subject. According to this embodiment, a virus vector comprising a heterologous nucleic acid sequence encoding an immunogenic polypeptide can be administered to a subject, and an active immune response is mounted by the subject against the immunogenic polypeptide. Immunogenic polypeptides are as described hereinabove. In some embodiments, a protective immune response is elicited.

Alternatively, the virus vector may be administered to a cell ex vivo and the altered cell is administered to the subject. The virus vector comprising the heterologous nucleic acid is introduced into the cell, and the cell is administered to the subject, where the heterologous nucleic acid encoding the immunogen can be expressed and induce an immune response in the subject against the immunogen. In particular embodiments, the cell is an antigen-presenting cell (e.g., a dendritic cell).

An “active immune response” or “active immunity” is characterized by “participation of host tissues and cells after an encounter with the immunogen. It involves differentiation and proliferation of immunocompetent cells in lymphoreticular tissues, which lead to synthesis of antibody or the development of cell-mediated reactivity, or both.” Herbert B. Herscowitz, Immunophysiology: Cell Function and Cellular Interactions in Antibody Formation, in IMMUNOLOGY: BASIC PROCESSES 117 (Joseph A. Bellanti ed., 1985). Alternatively stated, an active immune response is mounted by the host after exposure to an immunogen by infection or by vaccination. Active immunity can be contrasted with passive immunity, which is acquired through the “transfer of preformed substances (antibody, transfer factor, thymic graft, interleukin-2) from an actively immunized host to a non-immune host.” Id.

A “protective” immune response or “protective” immunity as used herein indicates that the immune response confers some benefit to the subject in that it prevents or reduces the incidence of disease. Alternatively, a protective immune response or protective immunity may be useful in the treatment and/or prevention of disease, in particular cancer or tumors (e.g., by preventing cancer or tumor formation, by causing regression of a cancer or tumor and/or by preventing metastasis and/or by preventing growth of metastatic nodules). The protective effects may be complete or partial, as long as the benefits of the treatment outweigh any disadvantages thereof.

In particular embodiments, the virus vector or cell comprising the heterologous nucleic acid can be administered in an immunogenically effective amount, as described below.

The virus vectors of the present invention can also be administered for cancer immunotherapy by administration of a virus vector expressing one or more cancer cell antigens (or an immunologically similar molecule) or any other immunogen that produces an immune response against a cancer cell. To illustrate, an immune response can be produced against a cancer cell antigen in a subject by administering a virus vector comprising a heterologous nucleic acid encoding the cancer cell antigen, for example to treat a patient with cancer and/or to prevent cancer from developing in the subject. The virus vector may be administered to a subject in vivo or by using ex vivo methods, as described herein. Alternatively, the cancer antigen can be expressed as part of the virus capsid or be otherwise associated with the virus capsid as described above.

As another alternative, any other therapeutic nucleic acid (e.g., RNAi) or polypeptide (e.g., cytokine) known in the art can be administered to treat and/or prevent cancer.

As used herein, the term “cancer” encompasses tumor-forming cancers. Likewise, the term “cancerous tissue” encompasses tumors. A “cancer cell antigen” encompasses tumor antigens.

The term “cancer” has its understood meaning in the art, for example, an uncontrolled growth of tissue that has the potential to spread to distant sites of the body (i.e., metastasize). Exemplary cancers include, but are not limited to melanoma, adenocarcinoma, thymoma, lymphoma (e.g., non-Hodgkin's lymphoma, Hodgkin's lymphoma), sarcoma, lung cancer, liver cancer, colon cancer, leukemia, uterine cancer, breast cancer, prostate cancer, ovarian cancer, cervical cancer, bladder cancer, kidney cancer, pancreatic cancer, brain cancer and any other cancer or malignant condition now known or later identified. In representative embodiments, the invention provides a method of treating and/or preventing tumor-forming cancers.

The term “tumor” is also understood in the art, for example, as an abnormal mass of undifferentiated cells within a multicellular organism. Tumors can be malignant or benign. In representative embodiments, the methods disclosed herein are used to prevent and treat malignant tumors.

By the terms “treating cancer,” “treatment of cancer” and equivalent terms it is intended that the severity of the cancer is reduced or at least partially eliminated and/or the progression of the disease is slowed and/or controlled and/or the disease is stabilized. In particular embodiments, these terms indicate that metastasis of the cancer is prevented or reduced or at least partially eliminated and/or that growth of metastatic nodules is prevented or reduced or at least partially eliminated.

By the terms “prevention of cancer” or “preventing cancer” and equivalent terms it is intended that the methods at least partially eliminate or reduce and/or delay the incidence and/or severity of the onset of cancer. Alternatively stated, the onset of cancer in the subject may be reduced in likelihood or probability and/or delayed.

In particular embodiments, cells may be removed from a subject with cancer and contacted with a virus vector according to the instant invention. The modified cell is then administered to the subject, whereby an immune response against the cancer cell antigen is elicited. This method can be advantageously employed with immunocompromised subjects that cannot mount a sufficient immune response in vivo (i.e., cannot produce enhancing antibodies in sufficient quantities).

It is known in the art that immune responses may be enhanced by immunomodulatory cytokines (e.g., α-interferon, β-interferon, γ-interferon, ω-interferon, τ-interferon, interleukin-1α, interleukin-1β, interleukin-2, interleukin-3, interleukin-4, interleukin 5, interleukin-6, interleukin-7, interleukin-8, interleukin-9, interleukin-10, interleukin-11, interleukin 12, interleukin-13, interleukin-14, interleukin-18, B cell Growth factor, CD40 Ligand, tumor necrosis factor-α, tumor necrosis factor-β, monocyte chemoattractant protein-1, granulocyte-macrophage colony stimulating factor, and lymphotoxin). Accordingly, immunomodulatory cytokines (preferably, CTL inductive cytokines) may be administered to a subject in conjunction with the virus vector.

Cytokines may be administered by any method known in the art. Exogenous cytokines may be administered to the subject, or alternatively, a nucleic acid encoding a cytokine may be delivered to the subject using a suitable vector, and the cytokine produced in vivo.

Subjects, Pharmaceutical Formulations, and Modes of Administration

Virus vectors and capsids according to the present invention find use in both veterinary and medical applications. Suitable subjects include both avians and mammals. The term “avian” as used herein includes, but is not limited to, chickens, ducks, geese, quail, turkeys, pheasant, parrots, parakeets, and the like. The term “mammal” as used herein includes, but is not limited to, humans, non-human primates, bovines, ovines, caprines, equines, felines, canines, lagomorphs, etc. Human subjects include neonates, infants, juveniles and adults.

In particular embodiments, the present invention provides a pharmaceutical composition comprising a virus vector and/or capsid of the invention in a pharmaceutically acceptable carrier and, optionally, other medicinal agents, pharmaceutical agents, stabilizing agents, buffers, carriers, adjuvants, diluents, etc. For injection, the carrier will typically be a liquid. For other methods of administration, the carrier may be either solid or liquid. For inhalation administration, the carrier will be respirable, and optionally can be in solid or liquid particulate form.

By “pharmaceutically acceptable” it is meant a material that is not toxic or otherwise undesirable, i.e., the material may be administered to a subject without causing any undesirable biological effects.

One aspect of the present invention is a method of transferring a nucleic acid to a cell in vitro. The virus vector may be introduced into the cells at the appropriate multiplicity of infection according to standard transduction methods suitable for the particular target cells. Titers of virus vector to administer can vary, depending upon the target cell type and number, and the particular virus vector, and can be determined by those of skill in the art without undue experimentation. In representative embodiments, at least about 10³ infectious units, more preferably at least about 10⁵ infectious units are introduced to the cell.

The cell(s) into which the virus vector is introduced can be of any type, including but not limited to neural cells (including cells of the peripheral and central nervous systems, in particular, brain cells such as neurons and oligodendrocytes), lung cells, cells of the eye (including retinal cells, retinal pigment epithelium, and corneal cells), blood vessel cells (e.g., endothelial cells, intimal cells), epithelial cells (e.g., gut and respiratory epithelial cells), muscle cells (e.g., skeletal muscle cells, cardiac muscle cells, smooth muscle cells and/or diaphragm muscle cells), dendritic cells, pancreatic cells (including islet cells), hepatic cells, kidney cells, myocardial cells, bone cells (e.g., bone marrow stem cells), hematopoietic stem cells, spleen cells, keratinocytes, fibroblasts, endothelial cells, prostate cells, germ cells, and the like. In representative embodiments, the cell can be any progenitor cell. As a further possibility, the cell can be a stem cell (e.g., neural stem cell, liver stem cell). As still a further alternative, the cell can be a cancer or tumor cell. Moreover, the cell can be from any species of origin, as indicated above.

The virus vector can be introduced into cells in vitro for the purpose of administering the modified cell to a subject. In particular embodiments, the cells have been removed from a subject, the virus vector is introduced therein, and the cells are then administered back into the subject. Methods of removing cells from subject for manipulation ex vivo, followed by introduction back into the subject are known in the art (see, e.g., U.S. Pat. No. 5,399,346). Alternatively, the recombinant virus vector can be introduced into cells from a donor subject, into cultured cells, or into cells from any other suitable source, and the cells are administered to a subject in need thereof (i.e., a “recipient” subject).

Suitable cells for ex vivo gene delivery are as described above. Dosages of the cells to administer to a subject will vary upon the age, condition and species of the subject, the type of cell, the nucleic acid being expressed by the cell, the mode of administration, and the like. Typically, at least about 10² to about 10⁸ cells or at least about 10³ to about 10⁶ cells will be administered per dose in a pharmaceutically acceptable carrier. In particular embodiments, the cells transduced with the virus vector are administered to the subject in a treatment effective or prevention effective amount in combination with a pharmaceutical carrier.

In some embodiments, the virus vector is introduced into a cell and the cell can be administered to a subject to elicit an immunogenic response against the delivered polypeptide (e.g., expressed as a transgene or in the capsid). Typically, a quantity of cells expressing an immunogenically effective amount of the polypeptide in combination with a pharmaceutically acceptable carrier is administered. An “immunogenically effective amount” is an amount of the expressed polypeptide that is sufficient to evoke an active immune response against the polypeptide in the subject to which the pharmaceutical formulation is administered. In particular embodiments, the dosage is sufficient to produce a protective immune response (as defined above). The degree of protection conferred need not be complete or permanent, as long as the benefits of administering the immunogenic polypeptide outweigh any disadvantages thereof.

A further aspect of the invention is a method of administering the virus vector to subjects. Administration of the virus vectors and/or capsids according to the present invention to a human subject or an animal in need thereof can be by any means known in the art. Optionally, the virus vector and/or capsid is delivered in a treatment effective or prevention effective dose in a pharmaceutically acceptable carrier.

The virus vectors and/or capsids of the invention can further be administered to elicit an immunogenic response (e.g., as a vaccine). Typically, immunogenic compositions of the present invention comprise an immunogenically effective amount of virus vector and/or capsid in combination with a pharmaceutically acceptable carrier. Optionally, the dosage is sufficient to produce a protective immune response (as defined above). The degree of protection conferred need not be complete or permanent, as long as the benefits of administering the immunogenic polypeptide outweigh any disadvantages thereof. Subjects and immunogens are as described above.

Dosages of the virus vector and/or capsid to be administered to a subject depend upon the mode of administration, the disease or condition to be treated and/or prevented, the individual subject's condition, the particular virus vector or capsid, and the nucleic acid to be delivered, and the like, and can be determined in a routine manner. Exemplary doses for achieving therapeutic effects are titers of at least about 10⁵, 10⁶, 10⁷, 10⁸, 10⁹, 10¹⁰, 10¹¹, 10¹², 10¹³, 10¹⁴, 10¹⁵ transducing units, optionally about 10⁸-10¹³ transducing units.

In particular embodiments, more than one administration (e.g., two, three, four or more administrations) may be employed to achieve the desired level of gene expression over a period of various intervals, e.g., daily, weekly, monthly, yearly, etc.

Exemplary modes of administration include oral, rectal, transmucosal, intranasal, inhalation (e.g., via an aerosol), buccal (e.g., sublingual), vaginal, intrathecal, intraocular, transdermal, intraendothelial, in utero (or in ovo), parenteral (e.g., intravenous, subcutaneous, intradermal, intracranial, intramuscular [including administration to skeletal, diaphragm and/or cardiac muscle], intrapleural, intracerebral, and intraarticular), topical (e.g., to both skin and mucosal surfaces, including airway surfaces, and transdermal administration), intralymphatic, and the like, as well as direct tissue or organ injection (e.g., to liver, eye, skeletal muscle, cardiac muscle, diaphragm muscle or brain).

Administration can be to any site in a subject, including, without limitation, a site selected from the group consisting of the brain, a skeletal muscle, a smooth muscle, the heart, the diaphragm, the airway epithelium, the liver, the kidney, the spleen, the pancreas, the skin, and the eye.

Administration can also be to a tumor (e.g., in or near a tumor or a lymph node). The most suitable route in any given case will depend on the nature and severity of the condition being treated and/or prevented and on the nature of the particular vector that is being used.

Administration to skeletal muscle according to the present invention includes but is not limited to administration to skeletal muscle in the limbs (e.g., upper arm, lower arm, upper leg, and/or lower leg), back, neck, head (e.g., tongue), thorax, abdomen, pelvis/perineum, and/or digits. Suitable skeletal muscles include but are not limited to abductor digiti minimi (in the hand), abductor digiti minimi (in the foot), abductor hallucis, abductor ossis metatarsi quinti, abductor pollicis brevis, abductor pollicis longus, adductor brevis, adductor hallucis, adductor longus, adductor magnus, adductor pollicis, anconeus, anterior scalene, articularis genus, biceps brachii, biceps femoris, brachialis, brachioradialis, buccinator, coracobrachialis, corrugator supercilii, deltoid, depressor anguli oris, depressor labii inferioris, digastric, dorsal interossei (in the hand), dorsal interossei (in the foot), extensor carpi radialis brevis, extensor carpi radialis longus, extensor carpi ulnaris, extensor digiti minimi, extensor digitorum, extensor digitorum brevis, extensor digitorum longus, extensor hallucis brevis, extensor hallucis longus, extensor indicis, extensor pollicis brevis, extensor pollicis longus, flexor carpi radialis, flexor carpi ulnaris, flexor digiti minimi brevis (in the hand), flexor digiti minimi brevis (in the foot), flexor digitorum brevis, flexor digitorum longus, flexor digitorum profundus, flexor digitorum superficialis, flexor hallucis brevis, flexor hallucis longus, flexor pollicis brevis, flexor pollicis longus, frontalis, gastrocnemius, geniohyoid, gluteus maximus, gluteus medius, gluteus minimus, gracilis, iliocostalis cervicis, iliocostalis lumborum, iliocostalis thoracis, illiacus, inferior gemellus, inferior oblique, inferior rectus, infraspinatus, interspinalis, intertransversi, lateral pterygoid, lateral rectus, latissimus dorsi, levator anguli oris, levator labii superioris, levator labii superioris alaeque nasi, levator palpebrae superioris, levator scapulae, long rotators, longissimus capitis, longissimus cervicis, longissimus thoracis, longus capitis, longus colli, lumbricals (in the hand), lumbricals (in the foot), masseter, medial pterygoid, medial rectus, middle scalene, multifidus, mylohyoid, obliquus capitis inferior, obliquus capitis superior, obturator externus, obturator internus, occipitalis, omohyoid, opponens digiti minimi, opponens pollicis, orbicularis oculi, orbicularis oris, palmar interossei, palmaris brevis, palmaris longus, pectineus, pectoralis major, pectoralis minor, peroneus brevis, peroneus longus, peroneus tertius, piriformis, plantar interossei, plantaris, platysma, popliteus, posterior scalene, pronator quadratus, pronator teres, psoas major, quadratus femoris, quadratus plantae, rectus capitis anterior, rectus capitis lateralis, rectus capitis posterior major, rectus capitis posterior minor, rectus femoris, rhomboid major, rhomboid minor, risorius, sartorius, scalenus minimus, semimembranosus, semispinalis capitis, semispinalis cervicis, semispinalis thoracis, semitendinosus, serratus anterior, short rotators, soleus, spinalis capitis, spinalis cervicis, spinalis thoracis, splenius capitis, splenius cervicis, sternocleidomastoid, sternohyoid, sternothyroid, stylohyoid, subclavius, subscapularis, superior gemellus, superior oblique, superior rectus, supinator, supraspinatus, temporalis, tensor fascia lata, teres major, teres minor, thoracis, thyrohyoid, tibialis anterior, tibialis posterior, trapezius, triceps brachii, vastus intermedius, vastus lateralis, vastus medialis, zygomaticus major, and zygomaticus minor, and any other suitable skeletal muscle as known in the art.

The virus vector can be delivered to skeletal muscle by intravenous administration, intra-arterial administration, intraperitoneal administration, limb perfusion, (optionally, isolated limb perfusion of a leg and/or arm; see, e.g. Arruda et al., (2005) Blood 105: 3458-3464), and/or direct intramuscular injection. In particular embodiments, the virus vector and/or capsid is administered to a limb (arm and/or leg) of a subject (e.g., a subject with muscular dystrophy such as DMD) by limb perfusion, optionally isolated limb perfusion (e.g., by intravenous or intra-articular administration. In embodiments of the invention, the virus vectors and/or capsids of the invention can advantageously be administered without employing “hydrodynamic” techniques. Tissue delivery (e.g., to muscle) of prior art vectors is often enhanced by hydrodynamic techniques (e.g., intravenous/intravenous administration in a large volume), which increase pressure in the vasculature and facilitate the ability of the vector to cross the endothelial cell barrier. In particular embodiments, the viral vectors and/or capsids of the invention can be administered in the absence of hydrodynamic techniques such as high volume infusions and/or elevated intravascular pressure (e.g., greater than normal systolic pressure, for example, less than or equal to a 5%, 10%, 15%, 20%, 25% increase in intravascular pressure over normal systolic pressure). Such methods may reduce or avoid the side effects associated with hydrodynamic techniques such as edema, nerve damage and/or compartment syndrome.

Administration to cardiac muscle includes administration to the left atrium, right atrium, left ventricle, right ventricle and/or septum. The virus vector and/or capsid can be delivered to cardiac muscle by intravenous administration, intra-arterial administration such as intra-aortic administration, direct cardiac injection (e.g., into left atrium, right atrium, left ventricle, right ventricle), and/or coronary artery perfusion.

Administration to diaphragm muscle can be by any suitable method including intravenous administration, intra-arterial administration, and/or intra-peritoneal administration.

Administration to smooth muscle can be by any suitable method including intravenous administration, intra-arterial administration, and/or intra-peritoneal administration. In one embodiment, administration can be to endothelial cells present in, near, and/or on smooth muscle.

Delivery to a target tissue can also be achieved by delivering a depot comprising the virus vector and/or capsid. In representative embodiments, a depot comprising the virus vector and/or capsid is implanted into skeletal, smooth, cardiac and/or diaphragm muscle tissue or the tissue can be contacted with a film or other matrix comprising the virus vector and/or capsid. Such implantable matrices or substrates are described in U.S. Pat. No. 7,201,898.

In particular embodiments, a virus vector according to the present invention is administered to skeletal muscle, diaphragm muscle and/or cardiac muscle (e.g., to treat and/or prevent muscular dystrophy or heart disease [for example, PAD or congestive heart failure]).

In representative embodiments, the invention is used to treat and/or prevent disorders of skeletal, cardiac and/or diaphragm muscle.

In a representative embodiment, the invention provides a method of treating and/or preventing muscular dystrophy in a subject in need thereof, the method comprising: administering a treatment or prevention effective amount of a virus vector of the invention to a mammalian subject, wherein the virus vector comprises a heterologous nucleic acid encoding dystrophin, a mini-dystrophin, a micro-dystrophin, myostatin propeptide, follistatin, activin type II soluble receptor, IGF-1, anti-inflammatory polypeptides such as the Ikappa B dominant mutant, sarcospan, utrophin, a micro-dystrophin, laminin-α2, α-sarcoglycan, β-sarcoglycan, γ-sarcoglycan, δ-sarcoglycan, IGF-1, an antibody or antibody fragment against myostatin or myostatin propeptide, and/or RNAi against myostatin. In particular embodiments, the virus vector can be administered to skeletal, diaphragm and/or cardiac muscle as described elsewhere herein.

Alternatively, the invention can be practiced to deliver a nucleic acid to skeletal, cardiac or diaphragm muscle, which is used as a platform for production of a polypeptide (e.g., an enzyme) or functional RNA (e.g., RNAi, microRNA, antisense RNA) that normally circulates in the blood or for systemic delivery to other tissues to treat and/or prevent a disorder (e.g., a metabolic disorder, such as diabetes (e.g., insulin), hemophilia (e.g., Factor IX or Factor VIII), a mucopolysaccharide disorder (e.g., Sly syndrome, Hurler Syndrome, Scheie Syndrome, Hurler-Scheie Syndrome, Hunter's Syndrome, Sanfilippo Syndrome A, B, C, D, Morquio Syndrome, Maroteaux-Lamy Syndrome, etc.) or a lysosomal storage disorder (such as Gaucher's disease [glucocerebrosidase], Pompe disease [lysosomal acid α-glucosidase] or Fabry disease [α-galactosidase A]) or a glycogen storage disorder (such as Pompe disease [lysosomal acid α glucosidase]). Other suitable proteins for treating and/or preventing metabolic disorders are described above. The use of muscle as a platform to express a nucleic acid of interest is described in U.S. Patent Publication No. 2002/0192189.

Thus, as one aspect, the invention further encompasses a method of treating and/or preventing a metabolic disorder in a subject in need thereof, the method comprising: administering a treatment or prevention effective amount of a virus vector of the invention to a subject (e.g., to skeletal muscle of a subject), wherein the virus vector comprises a heterologous nucleic acid encoding a polypeptide, wherein the metabolic disorder is a result of a deficiency and/or defect in the polypeptide. Illustrative metabolic disorders and heterologous nucleic acids encoding polypeptides are described herein. Optionally, the polypeptide is secreted (e.g., a polypeptide that is a secreted polypeptide in its native state or that has been engineered to be secreted, for example, by operable association with a secretory signal sequence as is known in the art). Without being limited by any particular theory of the invention, according to this embodiment, administration to the skeletal muscle can result in secretion of the polypeptide into the systemic circulation and delivery to target tissue(s). Methods of delivering virus vectors to skeletal muscle are described in more detail herein.

The invention can also be practiced to produce antisense RNA, RNAi or other functional RNA (e.g., a ribozyme) for systemic delivery.

The invention also provides a method of treating and/or preventing congenital heart failure or PAD in a subject in need thereof, the method comprising administering a treatment or prevention effective amount of a virus vector of the invention to a mammalian subject, wherein the virus vector comprises a heterologous nucleic acid encoding, for example, a sarcoplasmic endoreticulum Ca²⁺-ATPase (SERCA2a), an angiogenic factor, phosphatase inhibitor I (I-1), RNAi against phospholamban; a phospholamban inhibitory or dominant-negative molecule such as phospholamban S16E, a zinc finger protein that regulates the phospholamban gene, β2-adrenergic receptor, β2-adrenergic receptor kinase (BARK), PI3 kinase, calsarcan, a β-adrenergic receptor kinase inhibitor (βARKct), inhibitor 1 of protein phosphatase 1, S100A1, parvalbumin, adenylyl cyclase type 6, a molecule that effects G-protein coupled receptor kinase type 2 knockdown such as a truncated constitutively active bARKct, Pim-1, PGC-1α, SOD-1, SOD-2, EC-SOD, kallikrein, HIF, thymosin-β4, mir-1, mir-133, mir-206 and/or mir-208.

Injectables can be prepared in conventional forms, either as liquid solutions or suspensions, solid forms suitable for solution or suspension in liquid prior to injection, or as emulsions. Alternatively, one may administer the virus vector and/or virus capsids of the invention in a local rather than systemic manner, for example, in a depot or sustained-release formulation. Further, the virus vector and/or virus capsid can be delivered adhered to a surgically implantable matrix (e.g., as described in U.S. Patent Publication No. 2004-0013645).

The virus vectors disclosed herein can be administered to the lungs of a subject by any suitable means, optionally by administering an aerosol suspension of respirable particles comprised of the virus vectors and/or virus capsids, which the subject inhales. The respirable particles can be liquid or solid. Aerosols of liquid particles comprising the virus vectors and/or virus capsids may be produced by any suitable means, such as with a pressure-driven aerosol nebulizer or an ultrasonic nebulizer, as is known to those of skill in the art. See, e.g., U.S. Pat. No. 4,501,729. Aerosols of solid particles comprising the virus vectors and/or capsids may likewise be produced with any solid particulate medicament aerosol generator, by techniques known in the pharmaceutical art.

The virus vectors can be administered to tissues of the CNS (e.g., brain, eye) and may advantageously result in broader distribution of the virus vector or capsid than would be observed in the absence of the present invention.

In particular embodiments, the delivery vectors of the invention may be administered to treat diseases of the CNS, including genetic disorders, neurodegenerative disorders, psychiatric disorders and tumors. Illustrative diseases of the CNS include, but are not limited to Alzheimer's disease, Parkinson's disease, Huntington's disease, Canavan disease, Leigh's disease, Refsum disease, Tourette syndrome, primary lateral sclerosis, amyotrophic lateral sclerosis, progressive muscular atrophy, Pick's disease, muscular dystrophy, multiple sclerosis, myasthenia gravis, Binswanger's disease, trauma due to spinal cord or head injury, Tay Sachs disease, Lesch-Nyan disease, epilepsy, cerebral infarcts, psychiatric disorders including mood disorders (e.g., depression, bipolar affective disorder, persistent affective disorder, secondary mood disorder), schizophrenia, drug dependency (e.g., alcoholism and other substance dependencies), neuroses (e.g., anxiety, obsessional disorder, somatoform disorder, dissociative disorder, grief, post-partum depression), psychosis (e.g., hallucinations and delusions), dementia, paranoia, attention deficit disorder, psychosexual disorders, sleeping disorders, pain disorders, eating or weight disorders (e.g., obesity, cachexia, anorexia nervosa, and bulemia) and cancers and tumors (e.g., pituitary tumors) of the CNS.

Disorders of the CNS include ophthalmic disorders involving the retina, posterior tract, and optic nerve (e.g., retinitis pigmentosa, diabetic retinopathy and other retinal degenerative diseases, uveitis, age-related macular degeneration, glaucoma).

Most, if not all, ophthalmic diseases and disorders are associated with one or more of three types of indications: (1) angiogenesis, (2) inflammation, and (3) degeneration. The delivery vectors of the present invention can be employed to deliver anti-angiogenic factors; anti-inflammatory factors; factors that retard cell degeneration, promote cell sparing, or promote cell growth and combinations of the foregoing.

Diabetic retinopathy, for example, is characterized by angiogenesis. Diabetic retinopathy can be treated by delivering one or more anti-angiogenic factors either intraocularly (e.g., in the vitreous) or periocularly (e.g., in the sub-Tenon's region). One or more neurotrophic factors may also be co-delivered, either intraocularly (e.g., intravitreally) or periocularly.

Uveitis involves inflammation. One or more anti-inflammatory factors can be administered by intraocular (e.g., vitreous or anterior chamber) administration of a delivery vector of the invention.

Retinitis pigmentosa, by comparison, is characterized by retinal degeneration. In representative embodiments, retinitis pigmentosa can be treated by intraocular (e.g., vitreal administration) of a delivery vector encoding one or more neurotrophic factors.

Age-related macular degeneration involves both angiogenesis and retinal degeneration. This disorder can be treated by administering the inventive delivery vectors encoding one or more neurotrophic factors intraocularly (e.g., vitreous) and/or one or more anti-angiogenic factors intraocularly or periocularly (e.g., in the sub-Tenon's region).

Glaucoma is characterized by increased ocular pressure and loss of retinal ganglion cells. Treatments for glaucoma include administration of one or more neuroprotective agents that protect cells from excitotoxic damage using the inventive delivery vectors. Such agents include N-methyl-D-aspartate (NMDA) antagonists, cytokines, and neurotrophic factors, delivered intraocularly, optionally intravitreally.

In other embodiments, the present invention may be used to treat seizures, e.g., to reduce the onset, incidence or severity of seizures. The efficacy of a therapeutic treatment for seizures can be assessed by behavioral (e.g., shaking, ticks of the eye or mouth) and/or electrographic means (most seizures have signature electrographic abnormalities). Thus, the invention can also be used to treat epilepsy, which is marked by multiple seizures over time.

In one representative embodiment, somatostatin (or an active fragment thereof) is administered to the brain using a delivery vector of the invention to treat a pituitary tumor. According to this embodiment, the delivery vector encoding somatostatin (or an active fragment thereof) is administered by microinfusion into the pituitary. Likewise, such treatment can be used to treat acromegaly (abnormal growth hormone secretion from the pituitary). The nucleic acid (e.g., GenBank Accession No. J00306) and amino acid (e.g., GenBank Accession No. P01166; contains processed active peptides somatostatin-28 and somatostatin-14) sequences of somatostatins as are known in the art.

In particular embodiments, the vector can comprise a secretory signal as described in U.S. Pat. No. 7,071,172.

In representative embodiments of the invention, the virus vector and/or virus capsid is administered to the CNS (e.g., to the brain or to the eye). The virus vector and/or capsid may be introduced into the spinal cord, brainstem (medulla oblongata, pons), midbrain (hypothalamus, thalamus, epithalamus, pituitary gland, substantia nigra, pineal gland), cerebellum, telencephalon (corpus striatum, cerebrum including the occipital, temporal, parietal and frontal lobes, cortex, basal ganglia, hippocampus and portaamygdala), limbic system, neocortex, corpus striatum, cerebrum, and inferior colliculus. The virus vector and/or capsid may also be administered to different regions of the eye such as the retina, cornea and/or optic nerve.

The virus vector and/or capsid may be delivered into the cerebrospinal fluid (e.g., by lumbar puncture) for more disperse administration of the delivery vector. The virus vector and/or capsid may further be administered intravascularly to the CNS in situations in which the blood-brain barrier has been perturbed (e.g., brain tumor or cerebral infarct).

The virus vector and/or capsid can be administered to the desired region(s) of the CNS by any route known in the art, including but not limited to, intrathecal, intra-ocular, intracerebral, intraventricular, intravenous (e.g., in the presence of a sugar such as mannitol), intranasal, intra-aural, intra-ocular (e.g., intra-vitreous, sub-retinal, anterior chamber) and peri-ocular (e.g., sub-Tenon's region) delivery as well as intramuscular delivery with retrograde delivery to motor neurons.

In particular embodiments, the virus vector and/or capsid is administered in a liquid formulation by direct injection (e.g., stereotactic injection) to the desired region or compartment in the CNS. In other embodiments, the virus vector and/or capsid may be provided by topical application to the desired region or by intra-nasal administration of an aerosol formulation. Administration to the eye, may be by topical application of liquid droplets. As a further alternative, the virus vector and/or capsid may be administered as a solid, slow-release formulation (see, e.g., U.S. Pat. No. 7,201,898).

In yet additional embodiments, the virus vector can used for retrograde transport to treat and/or prevent diseases and disorders involving motor neurons (e.g., amyotrophic lateral sclerosis (ALS); spinal muscular atrophy (SMA), etc.). For example, the virus vector can be delivered to muscle tissue from which it can migrate into neurons.

Having described the present invention, the same will be explained in greater detail in the following examples, which are included herein for illustration purposes only, and which are not intended to be limiting to the invention.

Example 1 Materials and Methods

Rep Cloning—

pXR2 (Rep2Cap2) and pRep5Cap2 AAV helper plasmids served as templates for Rep cloning. The primer sequences used are indicated in Table 4. Two cloning strategies were used. Existing restriction sites were incorporated into primers for PCR (PCR-RD in Table 4) utilizing either pXR out fw or pXR out rev primers. PfuTurbo DNA Polymerase (Stratagene, La Jolla, Calif.) was used at the manufacturer's recommendations for all PCR reactions. PCR-RD products were digested with the enzymes indicated in Table 4 (NEB, Ipswich, Mass.) prior to ligation with T4 DNA Ligase (Invitrogen, Carlsbad, Calif.) according to the manufacturer's instructions. Alternately, an overlap-extension mediated PCR (OE-PCR) approach was used to produce Rep chimeras (Higuchi et al. (1988) Nucleic Acids Res. 16:7351). The Rep2 and Rep5 junction was incorporated into forward and reverse primers which were used in separate PCR reactions with the pXR out fw and rev primers (Table 4, only fw oligos indicated, rev oligos complimentary to fw). These overlapping PCR products were combined into a single PCR reaction and cycled as follows: 1 cycle at 94° C. for 30 seconds, 18 cycles of 30 seconds at 94° C., 30 seconds at 65° C., and 4 minutes at 72° C., 1 cycle of 10 minutes at 72° C. 1 μl of this reaction was used as template for a nested PCR with the pXR in fw and rev primers. Chimeras with the N-terminus of Rep2 and C-terminus of Rep5 were cloned into the Rep25aa166 construct between the PpuMI and MfeI sites. Chimeras with the N-terminus of Rep5 and C-terminus of Rep2 were cloned into the 52aa160 construct between the PpuMI and BstBI sites. All constructs were verified by DNA sequencing at the UNC-CH Genome Analysis Facility.

TABLE 4  Cloning SEQ Clone/Primer Method Orientation Sequence ID NO pXR out fw Forward 5′ CGAAAAGTGCCACCTGACGTCTAAGAAACC 126 pXR in fw Forward 5′ TCGAATTCGACGGCCAGTGAATTGTAATACGACTC 127 pXR out rev Reverse 5′ CCATGATTACGCCAAGCTCGGAATTAACCGCATGCGA 128 pXR in rev Reverse 5′ CCATGGCCGGGCCCGGATTCACC 129 Rep52aa84 PCR-RD AleI Reverse 5′ TTCACCCCGGTGGTTTCCACGAGCACGTGCATGTGGAAGTAGCTCT 130 CTCCCTTTTCAAACTGCACAAAG Rep52aa110 PCR-RD EagI Forward 5′ CCTCGGCCGCTACGTGAGTCAGATTCGCGAAAAACTGATTCAGAG 131 Rep52aa126 OE PCR Forward 5′ GTGGTCTTCCAGGGAATTGAACCCACTTTGCCAAACTGGTTCGCGGTC 132 Rep52aa138 OE PCR Forward 5′ CTGGGTCGCCATCACCAAGGTAAAGAAGGGAGGCGGGAACAAGGTGGT 133 GGATGAG Rep52aa146 OE PCR Forward 5′ GCGGAGCCAATAAGGTGGTGGATGAGTGCTACATCCCCAATTACTTGCTC 134 Rep52aa160 PCR-RD Bpu10I Reverse 5′ ACTGGAGCTCAGGTTGGACCTTCGGCAGCAGGTAG 135 Rep52aa175 OE PCR Forward 5′ CGTGGACAAACCTGGACGAGTATAAATTGGCCTGTTTGAATCTCACGGAG 136 CGTAAAC Rep52aa187 OE PCR Forward 5′ CTGAATCTGGAGGAGCGCAAACGGTTGGTGGCGCAGCATCTGACGCAC 137 Rep52aa207 PCR-RD SgrAI Reverse 5′ GATCACCGGCGCATCCGAGAACTCACGCTGCGAAGC 138 Rep25aa77 OE PCR Forward 5′ TAAGGCCCCGGAGGCCCTTTTCTTTGTGCAGTTTGAAAAGGGATCTG 139 Rep25aa97 OE PCR Forward 5′ CCACATGCACGTGCTCGTGGAAACCTCCGGCATCTCTTCCATGGTCCTCG 140 Rep25aa116 PCR-RD NruI Forward 5′ TCAGATTCGCGAAAAACTGGTGAAAGTGGTCTTCCAGG 141 Rep25aa125 OE PCR Forward 5′ GAATTTACCGCGGGATCGAGCCG CAGATCAACGACTGGGTCGCCATC 142 Rep25aa141 OE PCR Forward 5′ GGTCACAAAGACCAGAAATGGCGCCGGCGGAGCCAATAAGGTGGTGGATT 143 CTGG Rep25aa149 OE PCR Forward 5′ GAGGCGGGAACAAGGTGGTGGATTCTGGGTATATTCCCGCCTACCTGC 144 Rep25aa166 PCR-RD Bpu10I Forward 5′ CCAGCCTGAGCTCCAGTGGGCGTGGACAAACCTG 145 Rep25aa187 OE PCR Forward 5′ GTTTGAATCTCACGGAGCGTAAACGGCTCGTCGCGCAGTTTCTGGCAG 146 Rep25aa216 PCR-RD SgrAI Forward 5′ ATGCGCCGGTGATCAAAAGCAAGACTTCCCAGAAATACATGG 147 ITR2 Half1 Kpn Forward 5′ ATTATAGGTACCAGGAACCCCTAGTGATG 148 ITR2 Half 1 Sfi Reverse 5′ TAATAGGGCCCAAAGGGCCGGG 149 ITR2 Half2 Sfi Forward 5′ TTAATAGGCCCTTTGGGCCGGG 150 ITR2 Half2 Hind Reverse 5′ TATAATAAGCTTAGGAACCCCTAGTGATGGAG 151 ITR5 Halt1 Kpn Forward 5′ ATTATAGGTACCTACAAAACCTCCTTGCTTGAG 152 ITR5 Half1 Sfi Reverse 5′ TTAATAGGCCCTTTGGGCCGTCGC 153 ITR5 Half2 Sfi Forward 5′ TTAATAGGCCCAAAGGGCCGTCGTC 154 ITR5 Half2 Hind Reverse 5′ TATAATAAGCTTTACAAAACCTCCTTGCTTGAGAG 155

ITR Cloning—

ITRs were cloned into a pUC-18 plasmid with a GFP cassette (CMV promoter, SV40 polyA) cloned between the KpnI and EcoRI restriction sites. The ITRs were synthesized in two halves as 4 nmol Ultramer DNA oligos (Integrated DNA Technologies, Coralville, Iowa). SfiI restriction sites were incorporated into one hairpin arm the ITR for cloning (FIG. 1A). Due to inconsistencies of the reported sequence at the tip of the ITR5 hairpins between Chiorini et al. (1999), the published GenBank sequence (accession number NC_006152), and restriction mapping, an ITR2 hairpin was utilized for the ITR5 construct (FIG. 1A). 200 pg of each oligo was amplified in a PCR reaction using the ITR primers listed in Table 4. 2.5 U of PfuTurbo DNA Polymerase (Stratagene, La Jolla, Calif.) was used to amplify each half of the ITR as follows: 1 cycle at 94° C. for 4 minutes, 35 cycles of 45 seconds at 94° C., 30 seconds at 50° C., and 30 seconds at 72° C., 1 cycle of 10 minutes at 72° C. PCR reactions were purified and subject to digestion by KpnI and SfiI or HindIII and SfiI (NEB, Ipswich, Mass.). A triple ligation with the pUC-18 GFP plasmid and each half of the ITR was performed with T4 DNA Ligase (Invitrogen, Carlsbad, Calif.) for 1.5 hours at room temperature. All constructs were verified by DNA sequencing at the UNC-CH Genome Analysis Facility after linearization of the plasmid and ablation of the ITR secondary structure by SfiI digestion.

Western Blot Analysis—

Samples for Western blot analysis were harvested 48-72 hours after transfection of Ad-helper plasmid and the appropriate AAV helper construct. Cells were washed and resuspended in 100 μl PBS prior to addition of 100 μl 2× Laemmli Sample Buffer (100 mM Tris pH 6.8, 4% SDS, 200 mM DTT, 20% glycerol, 0.1% Bromophenol Blue). Samples were briefly sonicated and boiled for 10 minutes. Samples were run on NUPAGE 4-12% Bis-Tris gels (Invitrogen, Carlsbad, Calif.) at 160 volts for 90 minutes. Protein was transferred to a Nitrocellulose membrane (Invitrogen, Carlsbad, Calif.) via a wet transfer for 60 minutes at 30 volts. Gels were blocked overnight in 10% nonfat dry milk in 1× PBS/Tween (0.05%). Detection of both Rep2 and Rep5 proteins (all four sizes) was achieved with a monoclonal Anti-Adeno-Associated Virus Rep Protein antibody (clone 259.5, American Research Products, Belmont, Mass.) at a 1:20 dilution in PBS/Tween for 60 minutes at room temperature. After washing, a secondary HRP anti-mouse antibody was added at a 1:5,000 dilution in PBS/Tween for one hour at room temperature. After washing, SuperSignal West Femto Maximum Sensitivity Substrate (Pierce, Rockford, Ill.) was added and blots were exposed to X-ray film.

Cell Culture and rAAV Production—

HEK 293 cells were obtained from ATCC and cultured in Dulbecco Modified Eagle Medium (DMEM) supplemented with 10% Fetal Bovine Serum (Sigma, St. Louis, Mo.) and 100 units/ml penicillin and 100 μg/ml streptomycin and grown at 37° C. with 5% CO₂ saturation. Transfections were performed in six-well cell culture plates. 0.75 μg each of Ad-helper plasmid, AAV helper plasmid (either Rep2Cap2, Rep5Cap2, or the Rep mutant described), and the GFP plasmid containing the ITR (mutant or wt ITR as specified in text) were triple-transfected with polyethyleneimine (PEI) (25,000 linear molecular weight) as described (Xiao et al. (1998) J. Virol. 72:2224). Cells were harvested 48-72 hours post-transfection.

Hirt DNA Purification and Southern Blot Analysis—

Hirt DNA purification was performed as described (Hirt (1967) J. Mol. Biol. 26:365). Cells were harvested 48-72 hours post-transfection, washed in PBS, and resuspended in 370 μl Hirt Solution (0.01M Tris-HCl pH 7.5 and 0.1M EDTA) prior to addition of 25 μl 10% SDS and 165 μl 5M NaCl. Samples were incubated at 4° C. overnight prior to centrifugation. DNA was purified by phenol chloroform extraction and precipitated by an equal volume of isopropanol prior to resuspension in 50 μl sterile ddH₂O. 5 ul of each sample was digested with 4 U DpnI (NEB, Ipswich, Mass.) 2-4 hours at 37° C. prior to gel electrophoresis and Southern blot analysis to remove non-replicated transfected plasmid (Chomczynski (1992) Anal. Biochem. 201:134). The nylon membrane (Hybond-XL; GE Healthcare Life Sciences, Piscataway, N.J.) was hybridized to a probe corresponding to the GFP open reading frame labeled with the Random Primed DNA Labeling Kit (Roche, Indianapolis, Ind.) and d-CTP P³². Blots were visualized after exposure to a phosphorimager screen (GE Healthcare Life Sciences, Piscataway, N.J.).

Densitometry—

Densitometry was performed using the public domain NIH Image program (developed at the U.S. National Institutes of Health available on the Internet at the NIH website). Densitometry analysis of a DpnI resistant band on the agarose gel prior to transfer was used as a loading control to normalize values obtained from the Southern blot. The lowest value (absence of any vector replication) was subtracted from all values to account for background. In order to gauge relative replication efficiency, values for ITR2 vectors were divided by the value obtained from the Rep2-ITR2 control. ITR5 vectors were compared to the Rep5-ITR5 control. All values were obtained in triplicate (n=3). Error bars represent standard error (standard deviation divided by the root of 3). All samples were compared to controls on the same blot.

Molecular Modeling—

Molecular models were generated using Swiss-Model (available at the expasy.org website). The published crystal structure of the N-terminus of Rep5 complexed with the RBE (PDB accession #1rz9) was used as a template for all models. Visualization of protein structure rendering of images were performed with PyMOL (available at pymol.org). DNA folding was performed using the DNA mfold server (available at mfold.bioinfo.rpi.edu).

Example 2 Construction and Characterization of Chimeric ITRs

Previously, AAV replicative specificity was postulated to be driven by the trs sequence (Chiorini et al. (1999) J. Virol. 73:4293; Chiorini et al. (1999) J. Virol. 73:1309). Rep2 can nick the ITR2 trs (AGT/TGG) and the AAVS1 trs of human chromosome 19 (GGT/TGG) (Wu et al. (2001) Arch. Biochem. Biophys. 389:271). Rep5 nicks only the ITR5 trs (AGTG/TGG). However, alignment of the ITR2 and ITR5 sequences revealed several significant sequence and structural differences outside the trs sequence (FIG. 1A). The spacing between the putative RBE and the nicking stem was significantly different; three nucleotides (nt) for ITR2 and 15 nt for ITR5. Additionally, while the trs sequence is not tightly conserved between ITR2 and ITR5, neither is the height or overall length of the putative nicking stem.

A novel method was used to generate mutant ITRs in order to determine which portions of the ITR were responsible for replicative specificity. Previous studies have investigated Rep-ITR interactions in vitro largely due to the difficulty of synthesizing full length ITRs for in vivo assays. PCR through the secondary structure of the ITR is inefficient and sequencing through these elements typically requires radiolabeled chain-terminator sequencing (Young et al. (2000) J. Virol. 74:3953). The AAV ITRs are highly recombinogenic and are frequently mutated even in a plasmid context (Samulski et al. (1983) Cell 33:135).

In order to address these concerns, the ITRs were synthesized and amplified in halves (FIGS. 36A-36C). To assemble the halves, a SfiI site was included in one of the hairpin arms of the ITR. SfiI allowed the conservation of the RBE′ sequence (Brister and Muzyczka (2000) J. Virol. 74:7762). Cloning the ITR in a double D element (DD) format required only one ITR per plasmid for replication (Xiao et al. (1997) J. Virol. 71:941). The three core Rep functions necessary for AAV replication (Rep binding, helicase, and nicking) were analyzed by the presence or absence of intracellular replication of the plasmid. This assay provided the ability to quantitate Rep-ITR function in a physiological setting, removing the concern that highly purified Rep protein might take on aberrant function in vitro. This system also avoided concerns that previous in vitro assays used only a fragment of the ITR or that oligos used to recapitulate the ITR might not fold correctly.

An alignment of ITR2 (SEQ ID NO:17) and ITR5 (SEQ ID NO:18) (FIG. 1A) revealed several divergent elements which might confer Rep specificity. The spacer and nicking stem elements appeared to be the most likely candidates for unique interactions with their cognate Rep protein. This hypothesis was supported by low homology of these elements between AAV2 and AAV5.

Wt ITRs containing the SfiI site functioned as expected with Rep2 specific to ITR2 and Rep5 specific to ITR5 (FIG. 1B). Rep2-ITR2 replicated approximately 2-fold better than Rep5-ITR5, potentially due to the lower folding energy of ITR5 resulting in reduced plasmid stability prior to replication. Due to this minor difference in replicative fidelity, all ITRs replicated with Rep2 were normalized to Rep2-ITR2, while ITRs replicated with Rep5 were normalized to Rep5-ITR5 (FIG. 1B).

In order to confirm that the RBE and hairpin arms played no role in Rep specificity, we generated a chimeric ITR with ITR5 binding elements and an ITR2 spacer and nicking stem (ITR5+2SNS, SEQ ID NO:19). Only Rep2 replicated this ITR, confirming the determinants of replicative specificity lie in the spacer/nicking stem elements (FIG. 1B). While ITR5+2SNS replication was not as efficient as ITR2-Rep2, it was replicated at ITR5-Rep5 levels. Conversely, Rep5 specifically replicated an ITR comprised of ITR2 hairpins and hairpin spacer and the ITR5 spacer and nicking stem (ITR2+5SNS, SEQ ID NO:20, FIG. 1B). Rep5 replicated this ITR at wt levels. These data demonstrated that Rep-ITR specificity lies outside of the ITR binding regions.

Next, chimeric ITRs were created to explore whether the nicking stem or the spacing between the RBE and nicking stem harbored unique interactions with the Rep protein. An ITR with the ITR5 binding elements and spacer and the ITR2 nicking stem could not be replicated by either Rep2 or Rep5 (ITR5+2NS, SEQ ID NO:21, FIG. 1B). The corresponding chimeric ITR (ITR2 binding elements and spacer with an ITR5 nicking stem) was replicated by both Rep2 and Rep5 (ITR2+5NS, SEQ ID NO:22, FIG. 1B). This disparity suggested that the spacer and nicking stem play different roles in Rep-ITR specificity between AAV2 and AAV5.

Example 3 The Nicking Stem is Important for ITR5 Specificity

ITR2+5NS (SEQ ID NO:22) established that Rep2 is capable of nicking an ITR with an ITR5 nicking stem and that Rep-ITR specificity is not driven exclusively by the trs sequence (FIG. 1B). In order to determine the flexibility of Rep2 toward mutant nicking stems, ITR2s containing altered forms of the hairpin were generated (FIG. 2A). Rep2 is able to replicate an ITR with an ITR5 nicking stem even though the ITR5 nicking stem contains a different trs sequence, is one bp shorter, and has two fewer unpaired nucleotides at its tip (FIG. 2A). The substitution of the ITR5 nicking stem into ITR2 also allowed replication by Rep5.

To determine which element of the ITR2 nicking stem prevented Rep5 activity, specific portions of the ITR2 stem were altered. First, one bp at the top of the putative ITR2 nicking stem was removed to lower the height to that of ITR5 (ITR2-TA, SEQ ID NO:23). Removing the T-A bp also resulted in a trs resembling ITR5, nicking between G/T opposed to T/T. Rep2 continued to function on this ITR as did Rep5, demonstrating that Rep5 can tolerate five unpaired nucleotides at the tip of the stem as long as the stem height and nt sequence are correct. A similar deletion from the base of the ITR2 nicking stem reduced the height to that of ITR5 while retaining the ITR2 nicking site (ITR2-GC, SEQ ID NO:25). Rep2 continued to function efficiently on this ITR while Rep5 activity was ablated. This data suggested that the inability of Rep5 to function on ITR2 is primarily the sequence of the trs, specifically the requirement for a nick to be generated between G/T.

To determine the extent of Rep2 flexibility for different nicking stems, three additional ITR2 mutants were created. Extending the nicking stem by one bp at the base had no effect on replication by Rep2 (ITR2 9 nt, SEQ ID NO:30). However, a three bp extension was sufficient to ablate Rep2 function on the ITR (ITR2 11 nt, SEQ ID NO:32). Surprisingly, Rep2 was able to tolerate a three bp deletion from the base of the stem, underlining the flexibility of Rep2 with respect to nicking stem substrates (ITR2 5 nt, SEQ ID NO:28).

In order to explore the level of flexibility Rep5 possessed toward non-wt nicking stems, a panel of mutant ITR5s harboring altered nicking stems were created (FIG. 2C). Curiously, Rep2 replicated none of these ITRs, suggesting an element outside the ITR5 nicking stem is responsible for preventing Rep2 function. As in FIG. 1B, replacement of the ITR5 nicking stem with that of ITR2 resulted in the ablation of replication by Rep5, attributable to the incompatible trs sequence. The addition of one bp at the top of the ITR5 nicking stem severely decreased the ability of Rep5 to replicate the ITR (ITR5+TA, SEQ ID NO:24, FIG. 2D). This insertion disrupted the ITR5 trs sequence and increased the size of the stem one bp. However, the low level of replication by Rep5 on ITR5+TA suggests that the entire trs site of ITR2 is necessary to confer Rep2 specificity, not just the presence of a T/T nick site.

The addition of one bp to the base of the ITR5 nicking stem, preserving the ITR5 trs at the tip, nearly eliminated replication by Rep5 (ITR5+GC, SEQ ID NO:26). Likewise, the removal of one bp from the base of the ITR5 nicking stem strongly decreased replication by Rep5 (ITR5 6 nt, SEQ ID NO:35, FIG. 2D). This data suggests that Rep5 is sensitive both to the height of the nicking stem as well as to the sequence of the trs. Thus, Rep5 is unable to replicate ITR2 because the ITR2 nicking stem is one bp too tall and has an incompatible trs sequence.

Example 4 Spacer Length is Important for ITR2, not ITR5

While Rep2 can replicate a vector with an ITR5 nicking stem, it can not replicate wt ITR5 (FIG. 1B). The only difference between ITR5+2SNS (which Rep2 can replicate) and ITR5+2NS (which Rep2 cannot replicate) is the ITR5 spacer (FIG. 1B). The wt Rep2 spacer is three nt long while the wt Rep5 spacer is 15 nt long. Thus, we hypothesized that Rep2 may be sensitive to spacer length.

To explore the effect of spacer length on ITR2 and ITR5, a series of mutant ITR2s and ITR5s with differing spacer lengths were generated (FIGS. 3A and 3C). An insertion extending the ITR2 spacer to 10 nt ablated replication by Rep2 (ITR2 10 nt, SEQ ID NO:31, FIG. 3B). Similarly, substitution of the ITR2 spacer with the 15 nt spacer of ITR5 also ablated replication by Rep2 (ITR2 15 nt, SEQ ID NO:33, FIG. 3B). Rep5 was unable to replicate any of these vectors due to the presence of the ITR2 stem loop.

Rep5 displayed greater flexibility toward spacer elements of differing lengths. Replacing the 15 nt ITR5 spacer with that of ITR2 resulted in an ITR in which Rep5 retained the ability to replicate at a reduced level (ITR5 3 nt, SEQ ID NO:34, FIG. 3D). Additionally, the presence of the three nt spacer allowed Rep2 to function on this ITR. The addition of six nt to the ITR5 spacer (for a total spacer length of 21 nt) resulted in an ITR capable of being replicated by Rep5 at an efficient level (ITR5 21 nt, SEQ ID NO:37, FIG. 3D). Replication by Rep5 was effectively abolished only after the insertion of 15 nt into the spacer (ITR5 30 nt, SEQ ID NO:38, FIG. 3D). This panel of mutant ITR5s demonstrates the importance of a three nt spacer element for Rep2 function.

This data confirmed that the length of the ITR5 spacer was important to block Rep2 function. Even small insertions into the ITR2 spacer were not tolerated by Rep2. Meanwhile, Rep5 is flexible in regard to spacer length, demonstrating the ability to function on ITRs with spacers from 3-21 nt.

Example 5 The ITR5 Spacer Acts as a RBE for Rep5

The inability of Rep2 to function on ITRs with spacers longer than three nt led to the question of why Rep5 was so flexible in this regard. It was hypothesized that Rep5 might specifically bind the ITR5 spacer just as it binds the RBE. The inability of Rep2 to bind this sequence would preclude its function on ITR5. Supporting this hypothesis was a moderately conserved GAGY Rep binding motif extending throughout the ITR5 spacer (FIG. 4A). Additionally, as Rep monomers bind every four nt, the binding of three Rep5 monomers to the 15 nt spacer element would result in a three nt spacer, similar to that of ITR2 (Hickman et al. (2004) Mol. Cell 13:403).

If Rep5 does bind the loosely conserved GAGY motif, the removal of that motif from the spacer should abolish Rep5 function. Indeed, the ITR5 No GAGY mutant (SEQ ID NO:40) could not be replicated by Rep2 or Rep5 (FIG. 4B). This suggested that the specific sequence of the ITR5 spacer plays an active role in the Rep5-ITR5 interaction. Conversely, a spacer with a pure GAGY repeat should not disrupt the ability of Rep5 to function on the ITR. Indeed, Rep5 was able to replicate this ITR at wt levels (ITR5 GAGY, SEQ ID NO:39, FIG. 4B). Rep2 was also able to replicate this ITR efficiently, suggesting the poorly conserved nature of the GAGY repeat within the ITR5 spacer prevents an important DNA-protein interaction with Rep2 necessary for replication.

To explore how the ITR5 spacer functioned as an RBE, we removed three GAGY repeats from the hairpin side of the RBE (ITR5 Spacer RBE, SEQ ID NO:42, FIG. 4A). This essentially shifted the 16 nt RBE 12 nt closer to the nicking stem. Rep5 replicated this ITR efficiently, confirming the ITR5 spacer acts as a RBE (FIG. 4B). The slight reduction in replication fidelity of this ITR, as compared with that of wt ITR5, may signal the inability of Rep to properly interact with the RBE′ (Brister and Muzyczka (2000) J. Virol. 74:7762). Rep2 was again unable to replicate ITR5 Spacer RBE due to its inability to interact with the ITR5 spacer.

Next, we sought to extend the ITR2 spacer element to function as an extended RBE (FIG. 4C). The seven nt insertion attempted in FIG. 3A possessed essentially no GAGY homology (ITR2+7, SEQ ID NO:29, FIG. 4C). As a result, Rep2 could not replicate this ITR (FIG. 4D). Eight nt (two four nt GAGY repeats) inserted into the ITR2 spacer between the RBE and the existing spacer (ITR2+8 GAGY, SEQ ID NO:41) prevented replication by Rep2, demonstrating that the ITR2 RBE cannot be extended. This suggests that Rep2 may be dependent on RBE′ binding or a specific spacer length for proper oligomerization to function on its cognate ITR. Curiously, this requirement does not apply to Rep2 function on ITR5 GAGY (FIG. 4A).

Similar to ITR5 Spacer RBE, we retained the eight nt GAGY insertion into ITR2 while removing eight nt of GAGY from the hairpin side of the RBE (ITR2+8−8 Spacer RBE, SEQ ID NO:43, FIG. 4C). This shifted the RBE eight nt closer to the nicking stem. Rep2 replicated this ITR very inefficiently at a level below the detection threshold of densitometric analysis (FIG. 4D, Southern).

Example 6 Identification of Regions in Rep Responsible for ITR Specificity

Identifying the two elements of the ITR responsible for Rep specificity allowed us to map the regions of Rep2 and Rep5 involved in ITR specificity. We focused exclusively on the N-terminal 208 aa of the large Rep proteins as this region encompasses the DNA binding and endonucleolytic activity of the protein (Yoon et al. (2001) J. Virol. 75:3230). This region displays approximately 60% sequence conservation evenly distributed across the protein sequence (FIG. 5A). Residues involved in the active site of the protein are 100% conserved between Rep2 and Rep5 (Hickman et al. (2002) Mol. Cell 10:327). Residues implicated in binding the RBE′ are highly conserved (Hickman et al. (2004) Mol. Cell 13:403). Residues which bind the RBE display nearly perfect conservation except for two conservative substitutions near aa 140.

In order to map the regions of Rep involved in ITR specificity, a panel of chimeric Reps derived from Rep2 and Rep5 were generated (FIG. 5B). The ability of each chimeric Rep to replicate an ITR2- or ITR5-flanked vector in HEK 293 cells was determined by Southern blot (FIGS. 5B and 5D). Each Rep in the panel was verified by DNA sequencing and Western blot analysis (FIG. 5C). Every chimeric Rep showed similar protein expression profiles compared to wt. Densitometric analysis provided a comparison of the replication efficiency of each chimeric Rep with that of wt Rep2 or Rep5 (FIG. 5E). Chimeric Reps were named according to the aa location of the swap between serotypes; for instance, Rep25aa77 (SEQ ID NO:63) possesses the N-terminal 76 aa of Rep2 and the C-terminus of Rep5.

In the case of Rep5, replacement of the N-terminal 77 or 97 aa with Rep2 had no effect on ITR specificity nor a significant impact on replicative fidelity (FIGS. 5D and 5E). Larger pieces of Rep2 substituted onto the N-terminus of Rep5 were sufficient to prevent efficient replication of ITR5s (Rep25aa116, SEQ ID NO:65; Rep25aa125, SEQ ID NO:66; Rep25aa141, SEQ ID NO:67). This suggested that these chimeras possessed interruptions of a critical region of Rep5 for ITR5 specificity.

Rep2-based chimeras were unable to replicate ITR5s without the inclusion of the N-terminal 146 aa of Rep5 (Rep52aa146, SEQ ID NO:79, FIG. 5D). Rep52aa146 replicated ITR5 at wt levels, as did the three chimeras with larger portions of Rep5 on the N-terminus (Rep52aa160, SEQ ID NO:58; Rep52aa175, SEQ ID NO:59; Rep52aa207, SEQ ID NO:61). This mapping reveals that the critical region for ITR specificity in Rep5 lies between aa 97-146. Surprisingly, the Rep52aa146 clone also functioned efficiently on ITR2, constituting a Rep capable of replicating ITR2 and ITR5. This suggested that ITR specificity existed in two different regions of Rep.

For Rep2, the N-terminal 83 or 109 aa of Rep5 could be substituted with no effect on ITR specificity or major influence on replicative fidelity (Rep52aa84, SEQ ID NO:54; Rep52aa110, SEQ ID NO:55; FIGS. 5D and 5E). Chimeras including slightly larger portions of Rep5 were unable to replicate either ITR, again suggesting the interruption of a domain critical for ITR specificity (Rep52aa126, SEQ ID NO:56; Rep52aa138, SEQ ID NO:57).

Rep5-based chimeras were unable to replicate ITR2s without the inclusion of the N-terminal 149 aa of Rep2. However, ITR2 replication was inefficient (Rep25aa149, SEQ ID NO:68, FIGS. 5D and 5E). The inclusion of larger portions of Rep2 allowed replication of ITR2s to increase to wt levels (Rep25aa166, SEQ ID NO:69; Rep25aa216, SEQ ID NO:71). This data maps the Rep2 region involved in ITR specificity to aa 110-149. However, unlike Rep5, this was not the only region which played a role in ITR specificity. The ability of the Rep52aa146 chimera to replicate ITR2 and ITR5 vectors demonstrated a second region of Rep2 between aa 138-160 sufficient to allow replication of ITR2s even when the other critical region (aa 110-149) was Rep5. The isolation of two different Rep regions involved in ITR specificity was consistent with the discovery of two independent elements governing specificity within the ITR.

Example 7 Characterization of Rep Regions Involved in ITR Specificity

To characterize the Rep domains identified in FIGS. 5A-5E, chimeric Rep proteins which specifically exchanged the regions implicated in ITR specificity were created (FIG. 6A). Region 1 existed in Rep2 from aa 110-149 and in Rep5 from aa 97-146. Region 2 lay within Rep2 from aa 149-187 and Rep5 from aa 146-187. As in FIGS. 5A-5E, all chimeras were verified by DNA sequencing and Western blot analysis (FIG. 6B). Chimeras were then assayed for the ability to replicate ITR2- or ITR5-flanked vectors (FIG. 6C).

Replacing Rep5 region 1 with Rep2 yielded a clone unable to replicate either vector, suggesting the chimera lacked the ability to bind the ITR5 spacer or nick the ITR2 nicking stem (Rep525aa110-148, SEQ ID NO:72, FIG. 6C). Replacing Rep5 region 2 with that of Rep2 allowed this chimera to replicate an ITR2 vector, suggesting region 2 of Rep2 was critical to nick the ITR2 nicking stem (Rep525aa146-187, SEQ ID NO:73). The inability of this chimera to recognize ITR5 is harder to explain as Rep52aa146 could replicate ITR2 and ITR5 efficiently (FIG. 5B). This result suggests that Rep2 region 2 makes specific contacts within Rep2 aa 188-208 which are necessary in order to function on the ITR5 nicking stem. Replacing regions 1 and 2 of Rep5 with Rep2 resulted in a Rep chimera which replicated only ITR2s (Rep525aa110-187, SEQ ID NO:74).

Replacing Rep2 region 1 with Rep5 resulted in replication of only ITR2s, again demonstrating a connection between Rep2 region 2 and the ITR2 nicking stem (Rep252aa97-146, SEQ ID NO:75). The lack of ITR5 replication by Rep252aa97-146 is difficult to explain based on the Rep52aa146 chimera which replicates ITR2s and ITR5s efficiently (FIG. 5B). This result suggests that Rep5 region 1 makes specific contacts within the preceding 96 aa of Rep5 in order to replicate ITR5. Replacing Rep2 region 2 with Rep5 resulted in a chimera unable to replicate either ITR (Rep252aa149-187, SEQ ID NO:76). This chimeric Rep possesses neither Rep2 region 2 (required to nick the ITR2 nicking stem) nor Rep5 region 1 which appears to interact with the ITR5 spacer. Finally, replacing both Rep2 regions 1 and 2 with Rep5 resulted in a chimera capable of replicating only ITR5 vectors (Rep252aa97-187, SEQ ID NO:77).

The crystal structure of the N-terminal 193 aa of Rep5 complexed to the RBE allowed the location of these two critical regions to be modeled (Hickman et al. (2004) Mol. Cell 13:403). The structure of the N-terminus of Rep2 was modeled with Swiss-Model software using Rep5 as a template. The location of region 1 supports its involvement with the spacer/RBE (FIG. 6D). This region interacts with the major groove of the ITR where one of the most apparent structural differences between Rep2 and Rep5 is predicted (FIG. 6D, hatched circle). Rep2 contains a two aa insertion in this loop with respect to Rep5. This insertion and other non-conservative substitutions are likely responsible for the inability of Rep2 to interact with the ITR5 spacer.

Viewing Rep along the length of the ITR illustrates that region 1 constitutes much of the base of the protein (FIG. 6E). Both Reps are predicted to participate in a β-sheet motif in the center of this region, while areas of reduced homology exist toward either side (the loop interacting with the major groove of the ITR on one side, RBE′ interactions on the other). A more detailed look at region 1 reveals the greatest disparity between Rep2 and Rep5 occurs at the RBE binding interface in the major groove of the ITR (FIG. 6F).

There is very little predicted structural difference between region 2 of Rep2 and Rep5 (FIGS. 6D and 6E). In an effort to dissect this region, we created two additional clones: Rep52aa147 (SEQ ID NO:81) and Rep52aa151 (SEQ ID NO:83) (FIG. 6A). Like Rep52aa146, both of these Reps were able to replicate ITR2 and ITR5 vectors (FIG. 6C). Rep52aa146 and Rep52 aa147 replicated ITR2 and ITR5 vectors with equivalent efficiency, suggesting E147 of Rep2 is not involved in ITR specificity. Rep52aa151 did display a modest reduction in ITR2 replication compared to Rep52aa146, suggesting that C151 of Rep2 plays a role in ITR2 specificity. Because Rep52aa160 cannot replicate ITR2, this leaves only two other non-conserved residues between Rep2 and Rep5 in this region (N155 and T161). Both of these residues lie near the active site and are likely to interact with the nicking stem or active site. N155 lies directly adjacent to Y156, the nucleophilic tyrosine, and may play a major role in ITR2 specificity (FIG. 6G).

Example 8 Structure-Function Model of Rep-ITR Specificity

In order to unify the ITR and Rep elements involved in specificity into a single model, the chimeric Reps separating region 1 and region 2 along with the chimeric ITRs separating the nicking stem and spacer were utilized. Rep2, Rep5, Rep52aa146 (which divides region 1 and 2 of Rep and can replicate ITR2 and ITR5), and Rep25aa149 (essentially no ITR2 or ITR5 replication) were selected. These Reps were tested for their ability to replicate ITR2, ITR5, ITR2+5NS (which is replicated by both Rep2 and Rep5), and ITR5+2NS (which is replicated by neither Rep2 nor Rep5).

Only Rep2 and Rep52aa146 efficiently replicated ITR2 (FIGS. 7A and 7B). Only Rep5 and Rep52aa146 replicated ITR5. As in FIGS. 1A and 1B, Rep2 and Rep5 replicated ITR2+5NS. Additionally, Rep25aa149 (SEQ ID NO:68) and Rep52aa146 (SEQ ID NO:79) replicated ITR2+5NS. This ITR appears to be universally replicated by every Rep in this assay due to the exclusion of DNA elements involved in protein specificity. The three nt ITR2 spacer is amenable to the DNA binding region 1 of Rep2 and Rep5. The seven bp tall ITR5 nicking stem functions with region 2 of Rep2 and Rep5. Thus, any combination of these regions constitutes a Rep protein capable of replicating ITR2+5NS.

Finally, neither Rep2 nor Rep5 replicated ITR5+2NS. Rep2 is unable to interact properly with the 15 nt ITR5 spacer. Rep5 is unable to function on the ITR2 nicking stem. For these reasons, Rep25aa149 was also unable to catalyze replication. However, Rep52aa146 was able to replicate this ITR due to the proper combination of Rep regions (FIG. 7C). Rep52aa146 possesses Rep5 region 1 which interacts with the 15 nt ITR5 spacer. This chimera also possesses Rep2 region 2, which functions on the ITR2 nicking stem. This recombinant DNA-protein interaction is unique from either AAV2 or AAV5 and constitutes a novel Parvovirus origin of replication.

Taken as a whole, this work illustrates two specific mechanisms of DNA-protein specificity at the Parvovirus origin of replication. Chimeric ITRs narrowed the DNA elements involved in specificity to the spacer and nicking stem sequences (FIG. 1B). These results contradicted previous assertions that Rep-ITR specificity were driven solely by the nicking sequence as Rep2 efficiently nicked an ITR harboring the ITR5 nicking stem (Chiorini et al. (1999) J. Virol. 73:4293). Rep2 is highly flexible in the sequence and height of its nicking stem while Rep5 is highly specific to its cognate stem (FIGS. 2A-2D).

Three residues of Rep2 are important to cleave the ITR2 nicking stem (FIGS. 5A-5E and 6A-6G). Residues C151, N155, and T161 all lie in the active site of the protein in a predicted alpha helix along with the nucleophilic tyrosine Y156. How these residues (termed Rep region 2) grant Rep2 flexibility toward mutant nicking stems remains unclear. The corresponding Rep5 residues (G148, A152, and V158) may participate in highly specific interactions which require specific height and sequence considerations for the ITR5 nicking stem.

AAV5 Rep-ITR specificity is mediated by the ITR5 spacer. Replacement of the three nt ITR2 spacer with the 15 nt ITR5 spacer ablated replication by Rep2 (FIG. 2B). A poorly conserved Rep binding element allows Rep5 to interact with the elongated ITR5 spacer (FIG. 4B). Mutating the spacer to include a strong Rep binding element allowed Rep2 and Rep5 to replicate the ITR. However, insertion of a Rep binding element into the ITR2 spacer still largely decreased Rep2 function. While this data might suggest that additional Rep5 molecules bind to ITR5, previous in vitro experiments have not come to this conclusion, although those studies were performed in the absence of hairpins on the ITRs (Chiorini et al. (1999) J. Virol. 73:4293).

A 49 aa region of Rep5 interacts with the ITR5 spacer (aa 97-146, FIGS. 5A-5E and 6A-6G). The crystal structure of the N-terminus of Rep5 reveals that this region (region 1) possesses residues which specifically bind to the RBE and RBE′ of the ITR. Major structural differences in the Rep5 loop which binds the major groove of the RBE likely account for the majority of ITR5 spacer specificity. While FIG. 1B predicts RBE′ binding should not play a role in Rep-ITR specificity, it is possible that RBE′ contacts alter the secondary structure of region 1 as it interacts with the RBE.

Because the regions of Rep conferring ITR specificity were separate (region 1 of Rep5 from aa97-146 and region 2 of Rep2 from aa151-161), a chimeric Rep possessing both regions was able to efficiently replicate ITR2 and ITR5. An ITR which could be replicated by any wt or chimeric Rep was constructed by excluding the DNA elements required for specificity; the ITR5 spacer and the ITR2 nicking stem. Most significantly, a novel origin of replication was generated. This ITR contained both of the elements for Rep specificity; the ITR5 spacer and the ITR2 nicking stem. As a result, only a chimeric Rep protein made up of Rep5 region 1 and Rep2 region 2 was able to replicate the ITR. The creation of a unique origin of replication highlights the power of studying the DNA-protein interactions of a viral origin of replication.

The creation of a unique DNA-protein interaction was possible because of the separation of the specific Rep-ITR interactions in AAV2 and AAV5. How and why these two different DNA-protein interactions evolved is unclear. It is likely due to evolutionary divergence in the ITR sequence which may have occurred in different hosts (AAV2 is related to other primate AAVs, AAV5 is related to non-primate AAVs such as goat and bovine). This model of replicative specificity can likely be extended to other parvoviruses such as snake AAV which has a highly conserved T-shaped ITR structure but different spacer and nicking stem lengths (Farkas et al. (2004) J. Gen. Virol. 85:555).

These results also stand to improve the safety of future AAV therapeutic vectors. The danger of AAV vector mobilization by wt AAV could be averted if therapeutic vectors harbored ITRs which no wt Rep could replicate (Hewitt et al. (2009) J. Virol. 83:3919).

Example 9 Snake ITR Vector Production

HEK 293 cells were cultured in Dulbecco Modified Eagle Medium (DMEM) supplemented with 10% Fetal Bovine Serum (Sigma, St. Louis, Mo.) and 100 units/ml penicillin and 100 μg/ml streptomycin and grown at 37° C. with 5% CO₂ saturation. To produce snake (royal python) ITR vectors, 10 μg of each of the following plasmids were transfected by PEI into HEK 293 cells in a 15 cm culture dish: pXX680 (Ad helper plasmid), pSnTR-eGFP (the ITR containing plasmid, SEQ ID NO:124), pSnRepCap2 (AAV helper plasmid containing the snake Rep genes and AAV2 Cap genes, SEQ ID NO:125), and pXR2 (AAV helper plasmid containing the AAV2 Rep and Cap genes). See FIGS. 33-35. Alternately, a plasmid expressing only the small AAV2 Rep proteins (Rep52 and Rep40) could be used in place of pXR2. 48 hours post-transfection, the cells were harvested and vector was purified by CsCl gradient centrifugation as previously described for other AAV vectors.

The foregoing is illustrative of the present invention, and is not to be construed as limiting thereof. The invention is defined by the following claims, with equivalents of the claims to be included therein. 

That which is claimed is:
 1. A synthetic large Rep protein comprising a first portion from a first adeno-associated virus (AAV) and a second portion from a second AAV; wherein one of the first AAV and the second AAV is AAV5 and the other is an AAV other than AAV5; wherein the synthetic large Rep protein functionally interacts with a synthetic inverted terminal repeat (ITR) comprising a first structural element from the first AAV and a second structural element from the second AAV; and wherein the synthetic large Rep protein does not functionally interact with a wild-type ITR from the first AAV or with a wild-type ITR from the second AAV; and wherein said first and second structural element is a nicking stem, a spacer or an RBE.
 2. The protein of claim 1, wherein said protein comprises a third portion and wherein the synthetic large Rep protein functionally interacts with a synthetic ITR comprising the first structural element, the second structural element, and a third structural element.
 3. The protein of claim 1, wherein said first portion comprises an amino acid sequence from about residue 97 to about residues 146-151 of a wild-type AAV5 Rep sequence.
 4. The protein of claim 1, wherein said second portion comprises an amino acid sequence from about residue 149 to about residue 187 of a wild-type AAV2 Rep sequence.
 5. The protein of claim 1, wherein said first portion comprises an amino acid sequence from about residue 97 to about residues 146-151 of a wild-type AAV5 Rep sequence and said second portion comprises an amino acid sequence from about residue 149 to about residue 187 of a wild-type AAV2 Rep sequence.
 6. The protein of claim 1, wherein said first portion comprises an amino acid sequence from about residue 1 to about residues 146-151 of a wild-type AAV5 Rep sequence and said second portion comprises an amino acid sequence from about residue 149 to about residue 621 of a wild-type AAV2 Rep sequence.
 7. The synthetic large Rep protein of claim 1, wherein the AAV other than AAV5 is selected from the group consisting of AAV1, AAV2, AAV3a, AAV3b, AAV4, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12, and AAV13.
 8. The synthetic large Rep protein of claim 1, wherein the first AAV is AAV5.
 9. The synthetic large Rep protein of claim 8, wherein the second AAV is AAV2.
 10. The synthetic large Rep protein of claim 1, wherein the first AAV is AAV2.
 11. The synthetic large Rep protein of claim 10, wherein the second AAV is AAV5. 